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Abstract 

We examine the prospects for extending the Tevatron reach for a Standard Model Higgs boson 
by including the semileptonic Higgs boson decays h — > WW — > ti^i jj for Mh > 2 Mw, and 
h — > Wjj — > jj for Mfi < 2 Mw, where j is a hadronic jet. We employ a realistic simulation of 
the signal and backgrounds using the Sherpa Monte Carlo event generator We find kinematic selec- 
tions that enhance the signal over the dominant VK+jets background. The resulting sensitivity could 
be an important addition to ongoing searches, especially in the mass range 120 < Mh < 150 GcV. 
The techniques described can be extended to Higgs boson searches at the Large Hadron Collider. 
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1 Introduction 

The Standard Model (SM) predicts a neutral Higgs boson particle whose couplings to other particles 
are proportional to the particle masses, and that couples to photons and gluons via one-loop-generated 
effective couplings. While the Higgs boson mass is not predicted, the relation between the Higgs boson 
mass and its width is fixed from the predicted couplings. Virtual Higgs boson contributions to electroweak 
precision observables have been computed, and precision data favor Mh < 169 GeV at 95% confidence 
level Q. Searches at the CERN Large Hadron Collider have produced 95% confidence level exclusion of 
a SM Higgs boson for a broad mass range above 145 GeV 121 0. 

Because of small signal cross sections and large backgrounds, the search for the Higgs boson in experi- 
ments at the Fermilab Tevatron is very challenging, even with the large final datasets approaching 10 fb~^ 
per experiment. Nevertheless both the CDF and D0 experiments have achieved steady improvements in 
their sensitivities in multiple channels to a SM Higgs boson, and their individual results already exclude a 
SM Higgs boson in the mass range 156.7-173.8 GeV and 162-170 GeV, respectively, at 95% confidence 
level Hmm. This exclusion relies mainly on sensitivity to the dilepton final-state decay chain analyzed 
in Refs. IZllllil : h W+W' l+Vf, l-vg where = or 

Here we examine the prospects for extending the Tevatron reach by including a search for the semilep- 
tonic Higgs boson decays h —5- WW — )■ £ve jj for Alh > 2Mw, and h — )■ Wjj — ive jj for 
Mh < 2M\Y, where j is a hadronic jet. This process was first considered as a potential Higgs boson 
discovery channel for the SSC llT0l[TTllT2llT3l . emphasizing the case of a very heavy Higgs boson, where 
the h — )• ZZ — )• 4£ "golden mode" becomes limited by its small branching fraction and the broad Higgs 
boson width. Similar to the golden mode, the semileptonic h — )• WW modes are (almost) fully recon- 
structible: assuming that the leptonic W is close to on-shell, the mass constraint gives an estimate of the 
unmeasured longitudinal momentum of the neutrino, up to a two-fold ambiguity fT3^. For > 140 GeV 
the overall decay rate is 6 times larger than any other SM Higgs boson decay mode with a triggerable lep- 
ton. Including these semileptonic channels thus offers the distinct possibility of significantly extending 
the Tevatron reach over a rather broad mass range. 

This channel suffers from large backgrounds from SM processes with a leptonically decaying W boson. 
These include diboson production, top quark production, and direct inclusive M^+2-jet production. There 
is also a purely QCD background that is difficult to estimate absent a dedicated analysis with data. The 
dominant background is inclusive PF+2-jets; from this background alone we have estimated a signal to 
background ratio (S/B) of 3 x 10~^, after nominal preselections. Though worrisome, this is not smaller 
than the analogous S/ B ^ A x 10~^ for the e+e~ and ii~ modes after preselection in the successful 
Tevatron analyses ofh^ W+W'^i+Ui I'v^ |[T4l[T5l[lil[T7l[T8||T9J. 

A drastic reduction in both the W+2-jet and diboson backgrounds to semileptonic Higgs boson decay 
can be achieved by forward jet tagging, i.e. by restricting to Higgs boson production from vector boson 
fusion (VBF) ifTOl l20l : it is estimated that the additional requirement of forward jet tagging then gives 
a factor of ~ 100 reduction in these backgrounds. However the reduction in the Higgs signal, versus 
inclusive Higgs boson production, is also severe: a factor of ~ 10 at the Tevatron [21 J. Looking at the 
similar trade-off for the dilepton h — )• W~^W~ — )• i^ui £~vi channel, a Tevatron study 1221 concluded 
that the overall sensitivity does not improve by restricting to VBF Higgs boson versus inclusive Higgs 
boson production. We do not know of any comparable analysis for the semileptonic channel. 

For inclusive Higgs boson production at the Tevatron, the semileptonic channels were first studied by 
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Han and Zhang QIHl. In a parton-level study with some jet smearing they found that, after basic ac- 
ceptance cuts together with a veto on extra energetic jets designed to suppress the tt background, the 
remaining background is completely dominated by W+2-jets. Han and Zhang then made additional kine- 
matic selections that enhance the signal to background ratio S/B. For = 140 (160) GeV they thus 
obtained a significance estimate of S/^/B = 1.0 (3.3) for 30 fb~^ of integrated Tevatron luminosity. The 
fully differential Higgs boson decay width for this process was exhibited by Dobrescu and Lykken [i23l . 
who analyzed the basic kinematics and angular distributions that characterize the Higgs signal. 

We improve on these studies by including realistic parton showering (since parton-level jet smearing 
is inadequate), an NLO-rate improved treatment of the Higgs boson decays (including off-shell effects), 
and a resummed NNLO estimate of the gg ^ h production cross section. The first two improvements 
are incorporated by the use of Sherpa Il24ll25l . a general purpose showering Monte Carlo program, for 
simulation of both the signal and the inclusive 14^+2 -jets background. The NNLO signal cross section is 
modeled by a K-factor. 

Our purpose is to study these semileptonic Higgs boson decay channels in a systematic way, but not to 
mimic a fully-optimized experimental analysis. The D0 experiment has already reported on a semilep- 
tonic Higgs boson seaixh using 5.4 fb^^ of Tevatron data Il26ll27ll : this analysis uses multivariate decision 
trees to enhance the significance of the result. Here we will limit ourselves to simple cuts, in order to 
make the features of the analysis and the underlying physics more explicit. We study the Higgs signal in 
the mass range 110-220 GeV to reasonably cover the below, near and above threshold regions for Higgs 
boson decay to two on-shell W bosons. 

In Section |2] we outline the strategy and define several useful observables. In Section |3] we discuss 
inclusive Higgs boson production from the dominant gluon-gluon fusion mechanism, and implement a 
iiT-factor correction to the Sherpa result. In Section |4] we introduce basic preselections and develop cuts 
implemented in Sherpa to enhance S/\fB', this section also contains our main results. We conclude in 
Section [5] with caveats about the limitations of our analysis and suggestions for further improvements. 
Cross-checks and additional material are presented in the appendices. 

2 Strategy and key observables 

We are interested in the Higgs boson decay 



and the similar decay with a muon in the final state. In general we take both W bosons off shell. We will 
write eVf. jj as eUf, jj' where j is the jet with higher transverse momentum (pt), and noting that physical 
observables will be symmetric under j o j'. We use a baseline selection adopted from the D0 analysis to 
define reconstructed jets and leptons and impose realistic acceptance cuts. We will assume that events with 
more than one reconstructed lepton are vetoed, but we want to allow the possibility of extra jets in order to 
increase signal efficiency. When more than two jets are present there is a combinatorial problem; we will 
define a Higgs boson candidate selection algorithm that assigns which two jets to use in the Higgs boson 
reconstruction; these jets may or may not correspond to the two leading jets in the event. It is important 
to note that this algorithm is chosen so as to optimize the signal sensitivity after the full selection, which 
is not equivalent to maximizing the number of correctly reconstructed signal events. 




(1) 
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When the leptonically decaying W boson is (close to) on-shell, these decays are fully reconstructible 
up to a two-fold ambiguity in the neutrino momentum without making any assumption about the Higgs 
boson mass. Here we are assuming that the transverse momentum of the neutrino is well estimated given 
a measurement of the missing transverse energy (MET), as has been demonstrated by both Tevatron ex- 
periments in the determination of the W boson mass. 

The semileptonic channel's advantage of being, in principle, completely reconstructible offers a great 
way to separate signal from backgrounds. However, when the leptonically decaying W boson is far off 
shell, a straightforward full reconstruction is not possible. There are then three generic possibilities for 
how to proceed: 

• Use only transverse observables. 

• Perform an approximate event-by-event reconstruction using an estimate of the off-shell 
W boson mass. 

• Perform an approximate event-by-event reconstruction using a (hypothesized) Higgs 
boson mass constraint. 

Since it is not clear a priori which of these approaches maximizes the Higgs boson sensitivity, we will 
pursue all three and compare the results. 

Given an event-by-event approximate combinatorial full reconstruction of the putative decaying Higgs 
boson, one can approximately reproduce the kinematics in the Higgs boson rest frame. The true Higgs 
boson rest frame is given by a longitudinal boost from the lab frame together with a transverse boost 
defined by the transverse momentum pT^h of the Higgs boson. An explicit representation for the four- 
momenta in the Higgs boson rest frame is given by: 

Pe = ^ rneiy^(^jeuei^ + PeueCos0£), smOiCOSifie, sin 6*^ sin , jeiyeiPeu^ + cos 6e)^ , 

Pve = ^ rneiy^(^jeuei'^ - Peue COS 6e), -sinOi COS (pe , - sin Oesinipi, - -^^u^ {Peue - cos 6 

Pj = + Pjj'Cos9j), sinOj, 0, -Ijj'if^jj' + cos 9j)^ , 

Pj' = \mjf(^jjf{l-PjfCos9j),-smej,0,-jjj,{/3jj>-cos9j)^, (2) 

where we have chosen the dijet plane to coincide to the x-z plane, and have chosen the positive ^-axis to 
be the direction of the leptonically decaying W boson. The boost factors of the two W bosons relative to 
the Higgs boson rest frame are given by 



7jj' = 77-— 1 + 



2m,y r ■ Ml 
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and we note the identities 



m 



^jj' (^jf In' = "^ev, Peiy, lev, ■ 



(4) 



Note that 6j is the angle between jet j and the direction of the hadronic W boson, as seen in the W rest 
frame, while 9(. is the angle between the charged lepton and the direction of the leptonic W boson as seen 
in the W rest frame. The azimuthal angle is the angle between the dilepton and dijet planes. Defining 

we can calculate the angle Ojj' between the two jets as seen in the Higgs boson rest frame: 

cos Oii' = ~ ] . (6) 

Signal events have a minimum opening angle between the jets as seen in the Higgs boson rest frame: 

- 1 < cosOjj' < 2/3]j, - 1 . (7) 



In the approximate reconstructions that we will employ in our analysis, the Higgs boson mass Mh 
is approximated by a 4-object invariant mass meue.jj'- The transverse momentum of the Higgs boson is 
approximated by the 4-object transverse momentum PT,evejj'- The dijet boost ^jji defined in the Higgs 
boson rest frame is approximated by 7jj'|eve> which is the dijet boost defined in the 4-object rest frame, 
in which we can compute this boost factor via 7^/16!^^ = -^ii'/^ij'- This is equivalent to using Eqs. ([3]) 
where one inserts for each invariant mass its reconstructed counterpart. 

All three of these observables discriminate between Higgs signal and backgrounds. As seen in Figured] 
(top left), uiev^jj' for signal events peaks strongly near the true Higgs boson mass, with a width determined 
primarily by parton shower effects. Thus a simple mass window selection significantly enhances the 
signal, and since we are interested in Higgs boson exclusion there is no "look-elsewhere" effect associated 
with imposing mass windows ||28| . Note that the backgrounds are not necessarily flat in the mass windows: 
as seen in Figure [T] the dominant W+jets background is rather flat in the high mass window, but is steeply 
rising in the lower mass window because of the underlying kinematics. In Figure [U (top right) one sees 
that the 4-object transverse momentum PT,euejj' has a harder spectrum for Higgs signal events than for the 
W+jets backgrounds, independent of the Higgs boson mass. 

The reconstructed dijet boost ijj'iei/, has qualitatively different behaviour depending on the underlying 
Higgs boson mass. When the Higgs boson mass is close to 2M]y, the distribution of the dijet boost for 
signal events is strongly peaked near one compared to the distribution for W+jets, as seen in Figure [U 
(lower left). For larger Higgs boson masses the signal distribution of the dijet boost is instead rather 
strongly peaked around the value Mh/{2 My/), as expected from Eq. 

Other physical observables of interest for signal versus background discrimination are defined directly 
in the lab frame. This includes the 4-object pseudo-rapidity rif.y^jji, the pseudo-rapidity difference of 
the two jets, ^^id the scalar sum of the two selected jet transverse momenta Hj-jj'- As seen in 



7 



Combinatorial Higgs boson candidate selection using mass windows. 
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Figure 1: Examples of observables discriminating between Higgs signal and (the dominant VF+jets) backgrounds. 
Distributions are shown from the upper left to the lower right, for the reconstructed masses of the {e, i^e, J, j'} final 
states, their transverse momenta, the boost of the dijet subsystem with respect to the parent system and the scalar sum 
of the pt of the two jets j and j' . All results are obtained after the combinatorial Higgs boson candidate selection 
based on an ideal mass reconstruction of the potential resonance. The selection is facilitated by symmetric mass 
window constraints for both the 4-object and dijet mass: \mi,v^jji — Alh \ < A and \mjji — Mw\ < S, respectively. 
JVIore detailed explanations will follow in Section |4] Note that the background differential cross sections are scaled 
down by the respective factors indicated in the legends. 

Figure[T](lower right), tlie distribution of H^jj' for signal events is harder than for the W+jets background, 
independent of the Higgs boson mass. From the distributions shown in Figure [12] one sees that the dijet 
pseudo-rapidity difference has a maximum at zero for signal events (which tend to be confined to the 
central region), but peaks at a larger value for the W+jets background. Similarly from Figure [T3]one notes 
that the 4-object pseudo-rapidity distribution is more central for signal than for the VF+jets background. 

We will also employ various transverse masses, both as signal discriminators and as inputs to the algo- 
rithms for the approximate reconstruction of the Higgs boson candidates. One class of uit observables is 
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solely constructed out of transverse degrees of freedom, j = {Px,i,Py,i)', we define these rriT observ- 
ables as 



mT,ij.. = Y {\PT,i\ + \PT,j \ + . . - (.PT,i + PT,j + • • < ruij.. , (8) 

where, for the purposes of this study, the labels ij.. refer to two, three, or four final-state physics objects 
(charged lepton, MET, and the two selected jets). We also investigate how our results change when we 
adopt a slightly different definition that includes the full information from the invariant mass of the visible 
subset: 

{'^T,ik..i.) = '^ii.. + {ET,ii..PT,k.. - PT,ii.. ■ PT,k..) < rnik..i.. , (9) 

where = Ptu + ■ Here, we have separated the event into a "visible" (il..) and "invisible" (k..) 

part. The transverse masses are all approximately bounded from above by a kinematic edge; this gives us 
another handle when fully reconstructing the event. Schematically, we have 

(k ) 

^T,i[k..\l.. < f^T^iL.l.. ^ ^ik..l.. (10) 

where ?n"p just indicates that in the presence of multiple invisible objects the [k.] subsystem enters 
as a whole when computing Eq. In the 2-particle case, the two transverse-mass definitions coincide 
provided the single objects are massless. 

Given the above arsenal of kinematic discriminators and approximate reconstruction techniques, our 
basic strategy will be to find the most promising combinations of selections as a function of the Higgs 
boson mass. Since we are only performing a cut and count analysis, and are lacking a realistic detector 
description, there is no point in attempting a complete optimization. Instead we will concentrate on 
providing a comprehensive look at the physics that distinguishes signal from background. 



3 Inclusive cross sections and event generation 

We use the multi-purpose Monte Carlo event generator Sherpa Il24ll25l to pursue our analysis of semilep- 
tonic Higgs boson decays in Higgs boson production via gluon fusion. This way we can easily include all 
(hard and soft) initial-state radiation and final-state radiation (ISR and FSR) effects and arrive at a fairly 
realistic description of final states as used for detector simulations. Furthermore, sophisticated cuts can 
be implemented in a straightforward manner owing to the convenient analysis features that come with the 
Sherpa package. We also want to make use of Sherpa's capabilities in providing an enhanced model- 
ing of multi-jet final states with respect to a treatment by parton showers only. Apart from a handful of 
key processes, the Sherpa Monte Carlo program evaluates cross sections at leading order/tree level uti- 
lizing its integrated automated matrix-element generators AMEGIC++ ll29l and/or COMix [W]. However, 
in a number of studies, Sherpa has been shown to generate predictions that are in sufficient, often good 
agreement with the shapes of kinematic distributions obtained from measurements as well as higher-order 
calculations; we will give more details in the respective subsections that follow up. Hence, we wish to 
identify appropriate constant K-factors between the most accurate theory results and the leading-order 
predictions. We in turn want to apply these i^-factors to correct Sherpa's predictions for the inclusion 
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of exact higher-order rate effects. Therefore, we study signal and background fixed-order and resummed 
cross sections at Tevatron Run II energies for the processes 



leading to final states consisting of an isolated lepton, missing transverse energy and at least two jets. 
The label £ denotes electrons, e, and muons, /x; the parton label p contains light/massless-quark flavours 
and gluons. Note that final-state gluons may only occur in background hard processes or through the 
inclusion of I/FSR effects. For the respective ii'-factors, it is not clear a priori at which level of cuts 
they are defined most accurately. The most convenient definition should be given in terms of the total 
inclusive cross sections, while one may define more exclusive K-factors, if the higher-order tools allow 
for the specification of the desired cuts. We do not expect a strong dependence on the exact i('-factor 
definition provided the shapes are comparable. These issues are examined in more detail below with the 
goal of determining reasonable signal and background K-factors that can be used to rescale the respective 
leading-order cross sections a^^^ of the Sherpa predictions, which we take to pursue our signal versus 
background studies. 

3.1 Standard Model Higgs boson production and decay 

The signal processes for the final states of interest are summarized by 



where the Higgs particles are produced through gluon-gluon fusion and decay into W boson pairs that 
split further into the desired semileptonic final states. There are other Higgs boson production mechanisms 
that can contribute. In particular, the associated production PP — Vh with the additional vector boson 
V decaying hadronically and the production via vector-boson fusion (VBF) ought to be mentioned in this 
context. For all production channels, up-to-date theory predictions for the total inclusive cross sections 
of these events are needed to arrive at reliable acceptance estimates for various Higgs boson masses. 
Refs. in [311 |32l give the most recent overview of the theory calculations and results that are used as 
input for the ongoing Tevatron (and LHC) SM Higgs boson searches. For the Higgs boson masses we 
are interested in, it is appropriate to separate the Higgs boson production from its subsequent decays 
and multiply the production rates by the respective branching fractions, which we obtain from Hdecay 
||33l [Ml IMl [361 for /i ^ W*W* and the Particle Data Group (PDG) listings |[33 for the subsequent 
decays of the W bosons. 

For our main production channel, the Standard Model Higgs boson production via gluon fusion, we 
want to use the most precise theoretical predictions that have become available over the last few years, 
for a great review, we refer to Ref. [32]. Using an effective theory approach, this production channel is 
known at NNLO including electroweak and mixed QCD-electroweak contributions |[38l[39ll40l [4ri. For 
a wide range of Higgs boson masses, these NNLO cross sections have been shown to reproduce the latest 
results obtained from soft-gluon resummation up to NNLL accuracy, cf. Refs. [42. 43. 44]. To mimic the 
resummation effects, the optimal scale choice at NNLO is found to be = /^R = Mh/2, while for the 
NNLL calculation one employs common scales of ^ = M^. Both higher-order calculations take the most 
recent parametrization of PDFs at next-to-next-to-leading order into account where the corresponding 
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Table 1: Signal cross sections ag at NNLO and LO plus the resulting signal iiT-factors for several different SM 
Higgs boson masses. The Higgs boson widths are found from Hdecay calculations. Columns 5 and 6 show our 
input NNLO cross sections taken from recent Fehip calculations ||53l |54l |55l |38l and input branching fractions for 
h — i' W*W* obtained from Hdecay |34 35 |, respectively. They are comparable to the values given in the 3rd and 
4th columns that are used by the Tevatron experimentalists in their ongoing Higgs boson searches H. The signal 
cross sections labelled "all" account for contributions stemming not only from gluon-gluon fusion but also from 
Higgs strahlung and VBF Higgs boson production processes. All calculations use MSTW2008 parton distributions 
except those to extract the LO values denoted by "66", which result from using CTEQ6.6 PDFs. 



PDF sets have been provided by the MSTW group in 2008 B31 . The Tevatron Higgs boson searches use 
these higher-order gg ^ h cross section predictions to report their combined CDF and D0 upper limits 
on Standard Model Higgs boson production in the W+W decay mode CT[T5llT6l l4l l46l[r7llT8l[T9l . 
Hence, it makes sense to input the same theory cross sections in our studies to guarantee a reasonable level 
of compatibility between our work and the experimental searches. However, one should keep in mind 
that different viewpoints exist concerning the determination of the best gg ^ h cross section numbers. 
For example, in Ref. {^J\ the authors argue that the 10-15% enhancement seen in the inclusive rates is 
unlikely to survive the cuts applied in the Tevatron analyses and, therefore, should not be included in 
the calculation of the limits HSl |49l . On the contrary, the renormalization-group improved resummed 
NNLO cross sections discussed by Ahrens, Becher, Neubert and Yang in Refs. 150115111521 would yield a 
further 5-6% increase of the NNLL gg ^ h rates. This is because in their approach Ahrens et al. do not 
only resum threshold logarithms from soft-gluon emission but also vr^ -enhanced terms, which arise in the 
analytic continuation of the gluon form factor to timelike momentum transfer. 

For various Higgs boson masses. Tabled] summarizes the signal cross sections cj^°^ that are of relevance 
for our studies. Separated from the other entries, the left and right parts of the table show parameters, 
which we use as input to our analysis. We have used the NNLO gg ^ h inclusive cross sections, cr^^i^^ , 
given in the 5th column and multiplied them with the branching ratios listed in the column to the right 
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of it, which we have computed with Hdecay version 3.510 The Higgs boson widths shown in the 2nd 
column are also taken from the HDECAY calculation; they slightly differ from the older values given in 
ll2n . To arrive at the higher-order prediction of our signal cross sections depicted in the 8th column, 
''^SNNLO' furthermore have accounted for the W decays described by the branching ratios for one 
light lepton species, B{W — )• (ui) = 0.108, and for jets, B{W — )• pp) = 0.676, and a combinatorial 
factor of 2 reflecting that either of the W bosons may decay leptonically. The NNLO cross sections used 
here ll55l are updated values with respect to the ones published in [38 1. The difference can be traced back 
to the addition of the electroweak real-radiation corrections as encoded in Ref . 1 39 1 and the change to top 
masses of mt = 173.1 GeV. In the 3rd and 4th columns we respectively also give the gg ^ h cross 
sections and h — )• W* W* branching fractions as used by the Tevatron experimentalists in their ongoing 
searches [4]. These values are in good agreement with the respective numbers used in our study. The 
other signal cross sections listed in the rightmost part of Table [T] are the LO rates obtained from Sherpa, 
where the one labelled Cg'gg refers to the use of CTEQ6.6 PDFs ll56l . We will discuss these LO results 
and the corresponding if -factors in Section [3.3. II In the 7th column we show an upper estimate for the 
signal cross sections c^'aii ^'^^ were to include the contributions from the Wh, Zh and VBF production 
channels. Over the considered Higgs boson mass range the extra processes would enhance the signal rates 
resulting from gluon-gluon fusion by about 22-23%. We have determined these estimates by adding to 
the gg ^ h rates the theory predictions for awh^ CFzh and itvbf as presented in the July 2010 CDF and 
D0 Higgs boson searches combination paper f46l (for undated values, cf. f?]) including all necessary 
branching fractions to arrive at the pp Ivi pp final stateso The additional 20% increase resulting from 
these considerations should be born in mind when acceptances and significances are evaluated in a signal 
versus background study. For the purpose of the analysis we are pursuing here, we however want to be on 
the conservative side and solely concentrate on the gluon-gluon fusion events. 

3.2 Relevant background processes 

Processes that give rise to major background contributions are F+jets and multi-jet production, where the 
latter comes into play because of jets faking isolated leptons and/or MET. The muon channel suffers less 
from jet fakes reducing the multi-jet background by a factor of 5 with respect to the electron channel. The 
Zfjets background can also contribute in cases where one lepton goes missing or a jet mimics a lepton 
while the Z decays invisibly. Minor contributions stem from VV and it production. A first D0 search in 
the semileptonic Higgs boson decay channel shows how these backgrounds compare to each other after 
basic selection cuts, see Ref. [i26i . The largest fraction of 83% occurs from y+jets, followed by multi-jets, 

'To obtain the values in Table[T] tlie NNLO MSTW fit result for the strong coupling, as(Afz) — 0.11707, has been employed, 
the running of quark masses at NNLO has been enabled and the top and bottom quark masses have been set to mt = 173.1 and 
m.h — 4.8 GeV, respectively. 

^For all production channels, we consider the Higgs boson decay as specified in l ll2t . since our selection and reconstruction 
procedures are tailored to this decay mode, see Section |4| In our analysis we deal with £, MET and multiple jets (from the 
decays as well as I/FSR), thus, the presence of additional jets stemming from other hard decays or VBF does not alter our 
analysis procedures considerably, in other words the Higgs boson reconstruction and selection procedures are designed in a 
robust way with respect to additional jet activity. The dominant Higgs strahlung contributions to the semileptonic final states 
arise from the hadronic decays of the associated vector bosons, where we have used the PDG values B{W — >■ pp) — 0.676 and 
B{Z — >■ pp) = 0.6991. All other combinations are suppressed by about an order of magnitude. Also, we do not consider more 
than one lepton, i.e. we implicitly assume an exclusive one-lepton cut. We also neglect the Vh cases where the Z boson decays 
invisibly and the associated W boson splits up leptonically while h — >■ pppp. These modes will fail our h boson reconstruction. 
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tt and VV contributing with 12%, 3% and 2% to the overall background. The V+jets contribution is 
totally dominated by l^+jets production; the ^+£^)+jets background, where one of the leptons is 

missed, is small and makes up less than 1 % of the total background. 

To gain a better understanding of the backgrounds, we will have a closer look at the major contributor 
W+jets. The multi-jet background cannot be simulated straightforwardly, since it requires detailed knowl- 
edge of the experiments and measured fake rates etc. Regarding the minor background contributors, we 
will study the tt as well as the WW - or, more exactly, electroweak - background. Even though they 
enter at a rather low level after the basic selection compared to H^+jets, it is necessary to cross-check what 
number of events remain after more selective cuts have been applied, as will be discussed in Section H] 

Another background contribution that has been discussed is gluon-initiated vector boson pair produc- 
tion ll57l l58l l59l . This (quark-loop-induced) process occurs at 0{a^„a'^), the same order as the signal. 
This background formally arises at NNLO, but under realistic experimental cuts this production channel 
has been shown to significantly increase e.g. the WW — )■ 2£ 2i'e background at the LHC. At the Tevatron 
the gluon densities are small, so the impact of gg — )• WW is expected to be negligible. This expectation 
was confirmed in Ref. [60] (a 4%c effect with respect to the NLO cross section for this decay channel). A 
more important effect also recently pointed out by Campbell et al. in Ref. |60| is the interference between 
gg — )• WW and gg ^ h ^ WW, which can result in O(O.l) corrections to the Higgs boson signal cross 
section. However, interference effects are considerably reduced by requiring the transverse mass of the 
leptons plus MET system to be smaller than M^. This type of transverse cut is frequently used in our 
analyses, so we can safely neglect interference effects in our study. 

3.2.1 W boson plus jets background 

For our first study of W+jets production, we explore the dependence of inclusive W+jet cross sections on 
the number of jets and the variation of the common scale fi used to specify the factorization and renor- 
malization scales, fip and ^ur, respectively. This information will help us identify an optimal definition of 
the VF+jets i^-factor, which we take to improve the rates of the Sherpa predictions. We calculate inclu- 
sive W+n < 2-jet cross sections with MCFM version 5.8 to obtain results that are accurate at NLO in 
the strong-coupling constant 116111621 l63l . We also run MCFM at LO to determine explicit NLO-to-LO 
theoretical A'-factorsH 

We display our MCFM results in Table |2| for different inclusive jet bins n and scale choices ^. As 
expected, for each n-jet multiplicity, the NLO cross sections are more stable under scale variations with 
the largest deviations occurring for the more complex VF^-i-2-jet processes. This is also reflected by the 
various NLO-to-LO K-factors, which vary from about 0.9 to 1.5 forn = 2 while they are rather constant 
for n = ranging from about 1.3 to 1.4 only. For illustrative purposes, we also list the LO and NLO 
inclusive jet-rate ratios = cj„/(j„_i starting with n = 1. The W^+n < 1-jet cross sections do 

not deviate substantially for the two nominal scales chosen, /i ~ M±^]^ where = ~^ Ptw 

and n ~ Ht, which are determined dynamically for each event. Note that Ht is the scalar sum of the 

'We only consider bosons decaying into e+i^e pairs; the charge conjugated process will just double the cross section 
owing to the PP initial states at the Tevatron. We employ the LO and NLO MSTW2008 PDFs |45| with a^iMz) = 0.13939 
and as{Mz) = 0.12018, respectively, and impose cuts according to the parameters given in Section [431 Note that we do not 
account for the so-called triangle cut relating the transverse mass of the W boson and the missing energy. Other parameters, such 
as the electroweak input values of the Standard Model, have been taken according to the MCFM default settings. 
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Inclusive W~^+n- jet cross sections in pb. 



Ht/2 


Ht 


^2Ht 


453 


463 


468 


606 


602 


602 


1.34 


1.30 


1.29 


62.5 


52.7 


45.1 


74.2 


70.2 


65.7 


1.19 


1.33 


1.46 


0.138 


0.114 


0.096 


0.122 


0.117 


0.109 


10.9 


7.89 


5.89 


12.0 


10.1 


8.95 


1.10 


1.28 


1.52 


0.174 


0.150 


0.131 


0.162 


0.144 


0.136 



LO 457 465 469 

NLO 625 619 616 

K 1.37 1.33 1.31 

LO 66.2 55.6 47.3 

NLO 79.8 74.6 69.3 

K 1.21 1.34 1.47 

ii[o°' 0-145 0.120 0.101 

_R^Yo 0.128 0.121 0.113 

LO 14.4 10.1 7.40 

NLO 12.8 11.7 10.4 

K 0.89 1.16 1.41 

i?[fo^> 0.218 0.182 0.156 

i?^^LO 0.160 0.157 0.150 



+0.1 453 463 468 +fl 

-o.l 606 602 602 tgl 



tltl 62.5 52.7 45.1 tltl 

+1-° 74.2 70.2 65.7 



-2I7 10.9 7.89 5.89 

12.0 10.1 8.95 +111 



Table 2: Inclusive W^+n-jet cross sections i7„ in pb at LO and NLO in QCD for different scale choices and jet 
multiplicities using MSTW2008 PDFs. The variations with respect to the nominal choices, M±^\y with M]_ ^ = 

+ Pxw ^^'^ are given in the columns labelled by "%". Numerical integration uncertainties are not 
displayed, since they are at least one order of magnitude below the accuracy indicated here. NLO-to-LO i^T-factors 
and rt-to-(n — l)-jet cross section ratios are also shown for all possible instances. 



transverse momenta of all particles (partons) in the event, i.e. no jet clustering has taken place. The j^i ~ 
M±^w scales lead to slightly larger rates when compared to those obtained for fj, ~ Ht- This can be traced 
back to the occurrence of //-values that are on average larger in the latter case, since (Ht) > {M±^\y) for 
n > 1. For the same reason, the cross section differences become more manifest for n = 2. The presence 
of the second jet gives an extra pT contribution to Ht per event whereas M±^\y is less affected. This 
further enhances the deviation of the Ht and averages. 

Given the numbers of Table |2] we can conclude that our knowledge of the W+2-jet background is 
accurate on the level of < 20%. A /C-f actor of about 1.5 should be viewed as the upper limit for correcting 
LO results; in Section [3.3.2l we will however compare the Sherpa background rates more closely with the 
results of Table |2] and determine a K-factor accordingly. 



3.3 Monte Carlo simulation of signal and backgrounds using Sherpa 

For reasons outlined at the beginning of Section [3l we use Sherpa version 1.1.3 Il24ll25l to generate the 
^z/£+jets signal and background events that are needed to understand the potential of a Standard Model 
Higgs boson analysis in the lepton + MET + jets channel^ We will employ the results of the previous 



''Version 1.1.3 was the last of the previous SHERPA generation; for all our purposes, it models the necessary physics equally 
well compared to the upgraded versions of the current (1.3.x) generation. Cross-comparisons have confirmed this result. 
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two subsections to settle the inclusive K-factors needed to re-scale Sherpa's LO predictions and include 
higher-order rate effects. 

The signal and background simulations share a number of common parameters and options that have 
been set as follows: we simulate all events at the parton-shower level, i.e. we include initial- and final-state 
QCD radiation, but do not account for hadronization effects and corrections owing to the underlying event, 
since their impact is considerably smaller with respect to additional QCD radiation arising from the hard 
processes. The intrinsic transverse motion of quarks and gluons inside the colliding hadrons is however 
modeled by an intrinsic Gaussian fcy-smearing of /^(A;^) = 0.2 and a{kT) = 0.8 GeV. The electroweak 
parameters are explicidy given: Mw = 80.419, Tw = 2.06, Mz = 91.188, Tz = 2.49 GeV; the 
Higgs boson masses and widths are mutable, taken according to Table [TJ the couplings are specified by 
acw(O) = 1/137.036, sin^ = 0.2222 and the Higgs field vacuum expectation value and its quartic 
coupling are given as 246 GeV and 0.47591, respectively. The CKM matrix is simply parametrized by 
the identity matrix. The bottom and top quark masses are set to rrih = 4.8 and mt = 173.1 GeV, 
respectively, and all other quark masses are zero. To avoid any bias owing to the utilization of different 
PDFs and in order to develop a consistent picture, signal and background events are generated using the 
same parton distributions. Our first choice of PDFs is the LO MSTW set MSTW20081o90cl |4^, because 
its NNLO version has been the preferred PDF set used for the recent calculations of the gluon-gluon 
fusion Higgs boson production cross sections. The strong coupling is determined by one-loop running 
with as{Mz) = 0.13939, which is the advertised fit value of the LO MSTW2008 set. 

To gain some understanding of PDF effects, we compare our MSTW2008 results against predictions 
generated with a different PDF set. To fully establish the comparison on the same level as for the 
MSTW2008 PDFs, signal and background rates have to be predicted from theory using the alternative 
PDF libraries. We cannot follow this approach here, instead we start out from the same normalization that 
has been used for the Sherpa predictions calculated with MSTW2008 PDFs. After the application of our 
cuts we then focus on the differences induced by the alternative PDF set. As our second choice we employ 
the CTEQ6.6 PDF libraries 1561 where the strong coupling is set by as{Mz) = 0.118 and the running 
of the coupling is again computed at one loop. Notice that Sherpa invokes a 6-iiavour running for all 
strong-coupling evaluations. 



3.3.1 Generation of signal events 

We simulate signal events with electrons or positrons in the final state according to 

PP — )• — )• eue PP — > efe + jets . (13) 

The hard process composed as (7(7 — )• /i — )• pp is calculated at LO. The incoming gluons and the 
quarks arising from the decay undergo further parton showering, which automatically is taken care of by 
the Sherpa simulation. One ends up with the eu^ + jets final states generated at shower level. The hard- 
process tree-level matrix elements and subsequent parton showers needed for the simulation are provided 
by the Sherpa modules Amegic++ and Apacic++, respectively. For our purpose, it is sufficient to treat the 
muon final states in exactly the same manner as the electron final states, i.e. the muon decay channel is 
included by multiplication with the lepton factor = 2 at the appropriate places. 

The Higgs boson production occurs through gluon-gluon fusion via intermediate heavy-quark loops. 
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In Sherpa this is modeled at LO by an effective gg ^ h coupling where the top quarks have been 
integrated out. The EHC (Effective Higgs Couplings) implementation of Sherpa includes all interactions 
up to 5-point vertices that result from the effective-theory Lagrangian. These effective vertices can simply 
be added to the Standard Model. We do not work in the infinite top-mass limit, because we also want to 
consider Higgs bosons heavier than the top quark, the approximation however is well applicable only as 
long as mt > M^. The Higgs boson decays are described by 1 — )• 4 processes, i.e. we directly consider 
h — )• eVePP- We thereby make use of Sherpa's feature to decompose processes on the amplitude level 
into the production and decays of unstable intermediate particles while the colour and spin correlations 
are fully preserved between the production and decay amplitudes |[25l . This way one can focus on certain 
resonant contributions instead of calculating the full set of diagrams contributing to a given final state, 
which in our case would lead to the inclusion of contributions from the backgrounds. The intermediate 
propagators are allowed to be off-shell, such that finite-width effects are naturally incorporated into the 
simulation. This comes in handy especially for Higgs boson masses below the WW mass threshold as the 
1 — 4 decays moreover guarantee the inclusion of off-shell -boson effects. A consistent LO treatment 
would require the use of total Higgs boson widths as computed at LO. We instead put in the values from 
the Hdecay calculations 041 [35l as listed in Table[T] This modifies the Higgs boson propagators and one 
arrives at a more accurate description of the finite-width effects of the Higgs boson decays. The effect on 
the total rate, 

4°' = n^pM , (14, 

is nullified, since we eventually correct for the NNLO rates f^^jl^f^LQ worked out in Section [37T] 

In Ref. |[35]| a comparative study has been presented for Higgs boson production via gluon fusion at 
the LHC. Amongst a variety of predictions including those given by HNNLO |[64ll65ll . the Sherpa ver- 
sions 1.1.3 and 1.2.1 have been validated to produce very reasonable results for the shapes of distributions 
hke the rapidity and transverse momentum of the Higgs boson, pseudo-rapidities and transverse momenta 
of associated jets and jet-jet AR separations. We hence rely on a well validated approach that works not 
only for pure parton showering in addition to the Higgs boson production and decays, but also beyond in 
the context of merging higher-order tree-level matrix elements with parton showers. Nevertheless, we have 
carried out a number of cross-checks to convince ourselves of the correctness of the Sherpa calculations; 
for the details, we refer the reader to Appendix lA.il 

Finally we turn to the discussion of the i^-factors. Recalling our findings of Section [STl we want to 
re-scale Sherpa's leading-order signal cross sections a^^^ to the fixed-order NNLO predictions given by 



Fehip for Higgs boson production mgg ^ h fusion via intermediate heavy-quark loops 
To be consistent, the renormalization and factorization scales of the LO hard-process evaluations are 
chosen as for the higher-order calculations, which employ // = /xr, = /^f = The resulting cross 

sections ultimately define our signal ii'-f actors: 

_ %NNLO 



We have determined two sets of i^-factors for our two choices of PDFs where the K-factors and LO cross 
sections labelled by "66" refer to the case of utilizing the CTEQ6.6 libraries when calculating the LO 
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cross sections. Our results have already been summarized in Section [STT] they are presented in the right 
part of Table[T] The K-factors are remarkably stable varying slowly from 2.8 to 2.6 over the entire Higgs 
boson mass range when relying on MSTW2008 PDFs. In the CTEQ6.6 case, where we have employed 
M = AiR = /^El,= ~ Mh/2, they are larger due to the smaller LO rates but their magnitude still 

remains < 3.60 

In addition to the default scale choice of // = Mh/2 that we used for the MSTW runs, we have 
explored other options by essentially varying this default setting for /i by factors of 2. We obtained results 
for ^ = M/j/4, /i = \fI/2 K. Mh/2 and /u = \fl w Mh with the effect that the LO rates were varied by 
+20% to -15% but - as expected - no shape changes were induced. 

3.3.2 Generation of background events for W boson plus jets production 

We restrict ourselves to the Monte Carlo simulation of the channels. Their final states are generated 
through 

PP ei/e + 0,1,2 p ez/e + jets (16) 

using an inclusive W+2-jets sample obtained from the Catani-Krauss-Kuhn-Webber (CKKW) merging 
of the corresponding tree-level matrix elements with the parton showers (ME-i-PS) |[66l l67l . In these 
Ty+2-jets calculations the electroweak order is tied to a^„. Unlike the NLO calculation we do include 
matrix elements where the extra partons may occur as b quarks; effectively, they are however treated as 
massless quarks in the evaluation of the matrix elements and generation of the radiation pattern. The events 
are corrected for the 6-quark mass after the parton showering. This approach generates slightly harder 
Pt spectra but as part of being more conservative in estimating this background it is totally reasonable. 
Similarly, we simply assume no effect of a 6-jet veto in removing W+jets events. 

The parameters of the matrix-element parton-shower merging are the jet separation scale Q^ct and 
the Z'-parameter, which is used to fix the minimal separation of the parton jets. These parameters are 
respectively set to Qjet = 20 GeV and D = 0.4 in correspondence to the jet px threshold and cone 
definitions of our analysis, see Section l4~n Qjct denotes the scale at which - according to the internal kx- 
jet measure incorporating the D-parameter - the multi-jet phase space is divided into the two domains of 
Q > Qjet where the jets are produced through exact tree-level matrix elements and Qjct > Q > Qcut-off ~ 
1 GeV where the parton-shower intra-jet evolution takes place. We generate predictions from samples 
that merge matrix elements with up to n™^^ = 2 partons. Although we could increase this maximum 
number, at this point we do not want to include matrix elements with more than two partons in order to be 
consistent with our signal event generation where the jets beyond those arising from the 1^ -boson decays 
are produced by parton showers only. If one wishes to further improve on the description of additional 
hard jets, both background and signal simulations should be extended on the same footing. 

The V"+jets predictions of Sherpa have been extensively studied and validated over the last few years. 
Studies exist for comparisons against other Monte Carlo tools Il68l |69l 1701 1711 l72l . NLO calculations 
iH |69l Ha and Tevatron Run I and II data ||68l|7a|75l|25l|76l|77l|78l|79l. They have helped improve 
Sherpa gradually and provided evidence that Sherpa gives a good description of the shapes of the y+jet 



^The LO rates calculated with the CTEQ PDF libraries are diminished for two reasons mainly, the value of Os at AIz is 
considerably lower and the altered scale choice entails a further reduction of the cross sections. 
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LO 





0.92 


0.94 


0.95 


0.91 


0.93 


0.94 


496 




2 


1.45 


1.02 


0.75 


1.10 


0.80 


0.59 


9.90 


NLO 





1.26 


1.25 


1.24 


1.22 


1.21 


1.21 






2 


1.29 


1.18 


1.05 


1.21 


1.02 


0.90 





Table 3: Ratios at LO and QCD NLO taken between rates of MCFM and Sherpa CKKW (rightmost column) for 
inclusive ?i-jet production at different choices of scales in MCFM using MSTW2008 PDFs in all cases. The 
MCFM cross sections are listed in Table|2l 



final-state distributions missing a global scaling factor only, which can be extracted from the data |[25]| or 
higher-order calculations 1731 .In Appendix IA.2I we briefly highlight to what extent the CKKW ME-i-PS 
merging includes important features of NLO computations. 

We use the results of Table|2]to identify a reasonable i^'-factor for our simulated W+jets backgrounds. 
Relying on MSTW2008 PDFs, the Sherpa numbers for the inclusive and W^+2-]et cross sections are 
496 pb and 9.90 pb, respectively. The 0-jet Sherpa rate thereby is about 7% larger than the corresponding 
LO rates given by MCFM. The differences occur because on the one hand MCFM by default invokes a 
non-diagonal CKM matrix and a somewhat larger 1^ -boson width|^ on the other hand Sherpa's merged- 
sample generation relies on a very different scale-setting procedure compared to the leading fixed-order 
calculations. These differences have no effect on the kinematic distributions - and are fully absorbed by 
the i^-factor, i.e. CKM effects may eventually enter through the correction of Sherpa's rate. Table [3] 
summarizes the ratios between the MCFM predictions of Table |2] and Sherpa's CKKW cross sections 
mentioned above. This overview neatly points to the two options that give the most stable ratios; they are 
found at NLO for jjL = M^^\^ 12 and /^i = Ht/2 where the latter scale choice has been reported to be 
well suitable for even higher jet multiplicities llSOl 1731 ISTl . Based on these observations, we can hence 
conclude that it is fair to apply a i^-factor of 

Kb = 1.25 (17) 



to the W+jets backgrounds employed in our study. The number found here compares well to global K- 
factors as reported throughout the literature. 

As outlined at the beginning of Section 13.31 we want to normalize the backgrounds obtained with 
CTEQ6.6 to those computed with MSTW2008 PDFs. In the CTEQ case the Sherpa CKKW cross sections 
amount to 544 pb and 8.13 pb for the inclusive and W^+2-']e.t final states, respectively. Since the 
latter selection of W+2-jet events is more exclusive, we re-scale the CTEQ backgrounds according to 
X 8.13 pb = Kb X 9.90 pb and arrive at 

= 1.52. (18) 



^Switching to an unity CKM matrix and using Sherpa's input parameters, one finds 486 pb at ^ = Mw- 
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3.3.3 Generation of background events for electroweak and top-pair production 

The WW background enters at 0{a^^) of the electroweak coupling constant a^^, i.e. it is suppressed by 
more than two orders of magnitude with respect to the W+l-jets contribution occurring at ©(Og^Qg ). Still, 
without running the simulation we cannot say for sure whether the continuum WW production remains 
an 1 % effect after application of the analysis cuts and - if necessary - what handles exist to distinguish 
it from the signal. Because of the large resemblance between the topologies of the Higgs boson decay 
and the dominant WW production channels, we anticipate some of the cuts to be equally efficient for 
both signal and minor background. This makes it hard to estimate a priori the extent to which the Higgs 
boson signal will be diluted by the electroweak production type of processes. For the same reasons, the tt 
production final states can be expected to enhance the signal dilution on a similar level. Certainly, whether 
we end up with an 1% or 10% effect, this time it is sufficient to apply iC-factors taken from the literature. 

For the simulation of the diboson production background, we take the complete set of electroweak 
diagrams occurring at 0{a^^) into account including interference effects. This way we comprise physics 
effects beyond the plain WW production with subsequent decays of the gauge bosonsjZ] As before we 
only generate the processes regarding the first lepton family: 

PP — )• efg pp — )• eVe + jets (19) 

where additional jets are produced by the parton shower. Similar setups have been validated for Sherpa 
in (KT\ and more recently in fST.'STl. Here, we employ a dynamic choice, fi = \/I ~ 2 My, to calculate 
the scales of the LO processes. Parton-level jets are generated as in Section [3.3.2| using the same jet-finder 
algorithm and the same parameters (Qjet = 20 GeV and D = 0.4). Processes with bottom quarks are 
included; just as in the W+2-jets case, they are treated as massless. 

The tt background events are generated according to 

PP tt ^ bb eue pp — > eVe + jets (20) 

again utilizing the parton shower to describe any additional jet activity beyond that generated by the top 
quark decays. We only consider the semileptonic channel. The fully hadronic channel has to be considered 
together with the QCD background, and the fully leptonic channel will suffer from smaller branching 
fractions, the single isolated-lepton requirement and any dijet mass window that we impose around the W 
mass. The LO processes are calculated at the scale fi = nit, the mass of the b quarks is fully taken into 
account and the partonic phase-space generation is subject to the same jet-finding constraints as used for 
the compilation of the electroweak background. In addition we place mild generation cuts on the b quarks: 
PT,h > 10 GeV and Ai?5,p > 0.3. 

We also examined the impact of Zfjets production on our analyses, and found that this contribution 
makes up less than 1 % of the total background. Since Zfjets has kinematics similar to H^+jets, we will 
not study it further. 

In Sherpa the minor backgrounds are computed at LO. As in all other cases, we correct the total 
inclusive cross section for NLO effects by multiplying with global K-factors, which for both electroweak 
and ti production are larger than 1. Tevatron diboson searches hke Il85ll86l measure cross sections in good 



^Relying on the full set of electroweak processes is more conservative: the rate increases by about 20%; the effect on the 
shapes is rather small in general, although we observe slightly harder tails in pr distributions. 
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agreement with the prediction given by Campbell and Ellis (16.1 it 0.9 pb for WW+W Z). From their 
work [87] (Table III) we infer an NLO-to-LO K-factor ranging from 1.30 to 1.35. For our analysis, we 
will then use the conservative estimate!^ 



i^B,ew = 1.35. (21) 

For the inclusive production, we can safely estimate a conservative ii'-f actor of 1.30 by comparing the 
cross section results given for the Tevatron in Ref. f89l . Adopting a 6-tagging efficiency of the order of 
50% would give us a 75% chance of vetoing tt events with at least one 5-quark jet, i.e. we were able to 
remove about 3/4 of the tt background; again, we will be more conservative here and assume that about 
40% of the tt events will pass; hence, for our purpose, we finally assign 

i^-^-^^fto = 0.52. (22) 



4 Signal versus background studies based on Monte Carlo simula- 
tions using Sherpa 

We report the successive improvements of the S/ \fB significances when applying a series of cuts that 
preserve most of the signal and reduce the inclusive W+2-jets background significantly. 



4.1 Baseline selection 



We follow the event-selection procedure as used by the D0 collaboration ESI: hadronic jets j are iden- 
tified by a seeded midpoint cone algorithm using the i?-scheme for recombining the momenta 1901 . The 
cone size is taken as = 0.5 and selection cuts of > 20 GeV and \rf'^'^\ < 2.5 are imposed. Addi- 
tionally, we require a lepton-jet isolation of Aij'^P^j*^* > 0.4. For the leptonic sector, we apply transverse- 
momentum and pseudo-rapidity cuts of p]p' > 15 GeV and jry^'^Pl < 1.1, respectively, supplemented by a 
missing-energy cut via^ > 15 GeV. In addition, we also account for Mt^w + > 40 GeV, which 

is known as triangle cuto 



4.2 Higgs boson reconstruction based on invariant masses 

After the application of the basic cuts, we identify the best-fit {e, fe, J, j'} set from all possible candidates 
allowed by combinatorics. The algorithm we use to identify the best-fit object is referred to as the Higgs 
boson candidate selection. Several different selection algorithms are possible, however for now, we will 
use an invariant mass (or invm) selection: the four particles (reconstructed in a more or less ideal way) 
whose combined mass nieu^jj' is closest to a "test" Higgs boson mass are chosen. Of course, in the 

^NLO corrections to VV production can become large, for a recent example, see 1 88| where A'-factors as large as 1.77 have 
been reported; taken this value, we would certainly overestimate the electroweak contribution, since the CDF-type cuts employed 
in f88 1 are more exclusive. As for the shapes, we found them reliably described in a cross-check against an electroweak l^V+l-jet 
merged sample, including matrix-element contributions at 0{oL^^as). 

'The cut is applied to the leptonic W boson where Mt,w ~ y/{\pT.i\ + I^tI)^ ^ {Pt.i +^t)^ = 'mT,ivi,, cf. Eq. {8}. For 
Higgs boson masses above the WW threshold, the rate reduction and shape changes induced by this cut are marginal. 
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context of the analysis, the Higgs boson mass enters as a hypothesis and, thus, is treated as a parameter. 
Regardless of the selection algorithm, we refer to j and / as the two selected jets, which are not necessarily 
the hardest jets in the event. 

After selection, we impose a requirement on the absolute difference between meu^jj' and the hypoth- 
esized Higgs boson mass; events are kept only if they reconstruct a mass that lies within the window 
Mh — A < mf.i,^jji < Mfi + A. This completes our combinatorial Higgs boson reconstruction, which we 
label as "comb, /i-reco" in our tables. On top of this selection, we may include an additional dijet mass 
constraint of M\y — 6 < rrijjt < Myy + 6 (marked by \rhjjt — 80| < 5 in the tables). The selection pro- 
cedure will certainly shape - to some extent - the remaining background to look like the signal, however 
the primary effect we are interested in concerns the reduction of the background rate while we want to 
preserve as many signal events as possible. 

One may ask whether the reconstruction of the Higgs particle candidate can be achieved more easily by 
selecting the set containing the respective hardest particles, in particular, by choosing the two hardest jets, 
ji and j2, to reconstruct the hadronically decaying W boson. We will refer to this approach as the naive 
Higgs boson reconstruction, denoted as "naive /i-reco" later on (as before we use \rhj-^j^ — 80\ < 6 in the 
tables to indicate that a dijet mass constraint has been imposed in addition). There are no combinatorial 
issues in the naive scheme. However, as we show in our tables, it yields poorer significances than the 
selection based on combinatorics. 

We calculate the number of S signal and B background events for different Higgs boson masses as- 
suming a total integrated luminosity of £ = 10 fb^^. This seems to be a good C estimate for what each 
of the two Tevatron experiments, CDF and D0, were able to collect before the eventual Run II shutdown 
in September 2011. We compute the numbers according to 

S = Ksesa'^s^ x 2feC = Ksas x 2/,£, 

B = KBEB^f X = KeaB x 2^£ (23) 

where e and K respectively denote the total cut efficiencies and the K-factors, which we have worked out 
in Section [3lll cf. Tabled and Section [331 cf. Eqs. ((HJl, (EJ and (ED. The total efficiencies are 
a product of single-step efficiencies, i.e. e = Hi ^i- The factor fi = 2 accounts for including the decay 
channels that involve muons and their associate neutrinos. Notice that the Higgs boson mass enters in our 
simulation in two, potentially different ways. In practice, the Higgs boson mass that we used to generate 
the signal need not be the same as the Higgs boson mass we use to formulate the analysis. We refer to the 
former as the injected mass M™^ in the text, while we have already introduced the terminology of the latter 
as the "test" or "hypothesis" Higgs boson mass M^. However, for simplicity we take the generation level 
Higgs boson mass and the analysis level Higgs boson mass to be equal, Af ™^ = Mh- A discussion on how 
different generation versus analysis masses would change our results can be found in the Appendix lB.il 

We can now go ahead and calculate the S/B ratios and S/Vb significances. For various Higgs bo- 
son mass hypotheses. Table H] and Tables ITl-fTO] of Appendix IB. II list signal and W+jet-background cross 
sections, acceptances, S/B ratios and significances at different levels of cuts for the selection procedures 
discussed in this subsection. The Sherpa simulation runs obtained with the MSTW2008 LO PDFs have 
been used to extract the results of all tables except those of Table |9] presented in Appendix IB . 1 1 which are 
based on a set of runs taken with the CTEQ6.6 PDFs. In Appendix IB. II we will then also briefly discuss 
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Table 4: Impact of the different levels of cuts on the ei^g+jets final states for the gg — > /i -> WW production and 
decay signal and the W+jets background as obtained from Sherpa. Cross sections (75, as, acceptances es, Sb and 
S/B, S/VB ratios are shown for Higgs boson masses of Mh = 165 GeV, Mh = 170 GeV and Mh ^ 180 GeV. 
Note that — niij /GcY and S ~ (5/GcV. Significances were calculated using Eqs. (l23T l assuming £ = 10 fb~^ 
of integrated luminosity, counting both electrons and muons and combining Tevatron experiments. 
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the differences that can be seen between the predictions for the two PDF sets. 

We now turn to the discussion of the tables. Their setup is as follows: the rows represent different stages 
in the cut-flow, Higgs boson reconstruction strategies, and mass window cuts, while the third through 
fifth columns contain the outcomes for different Higgs boson masses. The second column indicates the 
mass window cut (in GeV, referred to as A in the text), which has been applied to all reconstructed 
Higgs boson candidates. Similarly, the number in square brackets next to each Higgs boson mass is 
the dijet mass window cut (referred to as 6, also in GeV). At every analysis level, six numbers are 
displayed for each Higgs boson mass. The top row displays the LO signal cross section (in fb), the LO 
W+jets cross section (in fb) and S/B at 10 fb^^ of integrated luminosity, calculated including iT-factors 
and factors of 2 following Eqs. (|23] ). The bottom three numbers in each table entry are the signal and 
background efficiencies and S/Vb. Of these entries, S/Vb is displayed in bold. For the first set of 
tables, Table|4]and|7](see Appendix lB.il ). we concentrate on Higgs boson masses greater than w 162 GeV 
- above the WW threshold. Higgs boson masses below the WW threshold have additional challenges, 
which we explore in a later subsection. 

The rows are divided into three groups. In the first group, rows 1^, the baseline selection cuts, as 
described in Section 14.11 are applied^ In the second group, rows 5-8, events are selected using the 
"naive" criteria, then retained if their reconstructed sum falls within various Higgs boson and dijet mass 
windows. Finally, in the last set of rows, 9-15, we select events with the "comb, /i-reco" algorithm, then 
apply several different mass windows. The effect of the mass window cuts, with either the "naive" or 
"comb, /i-reco" selection scheme, are fairly intuitive; mass windows always help because they emphasize 
the peaks in the signal in comparison to a featureless VF+jets background. Tighter mass windows are 
usually, but not always, better. Clearly, among the three groups the combinatorial selections give the best 
significances, followed by the naive ones, which already improve over the baseline selection cuts. 

Comparing rows with identical cuts but different selections ("naive" versus "comb, /i-reco"), such as 
rows 5 and 9, or 7 and 11, the combinatorial Higgs boson reconstruction is better across all Higgs boson 
masses by roughly 30%. The difference can be traced to events where one of the hardest jets comes 
from I/FSR rather than from one of the jets of the W decay. Had we truncated our treatment of the 
background at the matrix-element level (or even at matrix-element level plus some Gaussian smearing, 
as in Ref. [7 |, additional jet activity arising from I/FSR would be absent and the "comb, /i-reco" scheme 
would give the same result as the "naive /i-reco" scheme. Incorporating these relevant I/FSR jets using 
a complete, matrix-element plus parton-shower treatment of the background, we notice that the "naive" 
scheme is no longer the best option. The ME-i-PS merging thereby allows us a fully inclusive description 
of W+2-jet events on almost equal footing with the related NLO calculation, however with the advantage 
of accounting for multiple parton emissions at leading-logarithmic accuracy. These effects aie pivotal to 
obtain reliable results for the combinatorial selections. 

Showering effects are not just limited to the background. In particular, the width of the Higgs boson 
candidates reconstructed from showered events is much broader than the reconstructed width derived from 
parton level. In fact, after showering, the reconstructed Higgs boson peak is typically so broad that the 
tightest mass windows used in the tables (A = 5 and 8 GeV) cut out some of the signal and yield worse 
significances than broader windows. For example, the combinatorial selections supplemented by a dijet 
mass window yield FWHM of about 10 GeV at the shower level, while the FWHM at the parton level 
are reduced down to 2 GeV - that basically is the width of one bin. If we relied on the matrix-element 

'"Note that the "lepton & MET cuts" level also includes a lepton-jet separation of AR'^'^^^^ > 0.4 in the presence of jets. 
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Significance vs. Higgs boson mass away from the charged diboson threshold. 
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Figure 2: S/^/B significances for Higgs boson masses varying from M/j = 110 to 220 GeV after different levels 
of cuts. The numbers are taken from Tables l4l and iTl-ITOl which reflect in more detail the outcome of the analysis 
based on the invm selection procedure for ei/e+jets final states originating from the gg h ^ WW signal and 
the W+jets background. Results are shown for Higgs boson masses below and above the WW mass threshold; 
the threshold region has been left out though. All significances were calculated according to Eqs. (|23T l under the 
assumption of an integrated luminosity of £ = 10 fb^^ and including electron and muon channels, i.e. fi — 2. 



level results, we would obtain far too promising S/ \fB. Focusing on the = 180 GeV test point and 
the "comb, /i-reco" with a dijet mass window, we would find the significances increasing from 1.5 for 
A = 25 GeV, 2.2 for A = 10 GeV to 3.0 for A = 5 GeV. These numbers should be contrasted with 
those in Table |4l namely 1.12, 1.19 and 1.02, respectively. 

To conclude this discussion, it is illustrative to show a plot of the significances versus Higgs boson 
masses for various selections as presented in the tables (Table |4] and Tables ITI-ITOl in Appendix IB.ll) . all 
of which is summarized in Figure |2l The significance, at least after this level of analysis, reaches a 
maximum of ~ 2.0. The highest significance occurs, as expected, near the WW threshold. For heavier 
Higgs bosons, as the WW decay mode becomes subdominant to the ZZ mode, the significance drops 
slowly, reaching ~ 1.0 at ~ 185 GeV. By gradually enhancing the selections the gain in significance 
remains approximately equal over the whole region of large M^, this is indicated by the parallel shifts 
of the respective significance curves. Hence, the differences seen in the significances per Mh test point 
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are mainly driven by the behaviour of the total inclusive cross section for the signal. Looking back at 
Tables |4] and Ul we in fact realize that the acceptances £5 and are rather similar at any selection step 
(for each row), only mildly varying across the different Higgs boson test mass points. For Higgs boson 
masses below threshold (Table [8] in Appendix IB. IK as we will discuss in the next sections, the drop-off 
is more severe. Not only does the branching fraction to WW* fall rapidly, but the signal becomes more 
background-like once the two W bosons from the Higgs boson decay cannot both be produced on-shell. 
The significances shown in Figure |2] reflect our best estimates, however, we have also performed several 
checks on the stability of these significances under slight variations in the analysis. These checks not only 
include - as mentioned earlier - varying parton distribution functions, but also varying jet definitions, etc. 
and are summarized in Appendix lB.il 

4.2.1 Reconstruction below the on-shell diboson mass threshold 

In the above-threshold case there is good hope that the idealized approach of considering the neutrino 
as a fully measurable particle will not lead to results, which are sizably different from those obtained by 
a realistic treatment of neutrinos. This is based on the fact that in most cases the leptonic W will be 
on its mass shell. The approximation rrie,^^ « Myy can in principle be used to determine the neutrino's 
longitudinal momentum - up to a twofold ambiguity - by employing the lepton and missing transverse 
energy measurements. Below the WW mass threshold one of the W bosons will be off-shell, so that the 
simple ansatz in calculating p\\^ue ^^^^ rather inaccurate. Hence, it a priori is not clear whether an event 
selection based on invariant-mass windows will give an overall picture that can be maintained in more 
realistic scenarios. Nevertheless here we briefly establish what kind of significances may be achievable 
assuming we had knowledge about the off-shellness (the actual mass) of the leptonic W boson. This will 
give us a benchmark, which we may use to assess more realistic reconstruction approaches. 

When we apply the same analysis as above the WW threshold, we find significances as summarized 
in Table [8] of Appendix IB. II They are visualized in Figure [2l The numbers demonstrate that we quickly 
lose sensitivity below the WW threshold, in particular for test points < 130 GeV. This happens for 
three reasons (which apply to the signal only): one factor is the decline of the total inclusive signal cross 
section ct^^nnlq towards lower M^, which actually is comparable to that seen for large M^. As shown 
in Tabled! this effect is not as drastic as one would assume from the drop in the h — >• W*W* branching 
ratios; it is partly compensated by the rising gluon-gluon fusion rate for low M^. In contrast to the above- 
threshold case, there are yet two more factors coming into the equation. Firstly, the basic selection cuts 
affect the signal more severely^ secondly, the low signals that pass the baseline selection are often 
penalized because of substantial off-shell effects. In particular, the Higgs boson propagator can be pushed 
far off-shell and the Higgs boson reconstruction will fall outside the mass window, such that the event will 
be discarded. The tendency for lighter Higgs bosons to go off-shell increases, since the basic cuts make it 
extremely unlikely for the leptonic and hadronic W masses to drop below ~ 30 GeV. 

Figure |3] shows the rriev^jj' spectra including shower effects for signals and backgrounds at = 130 
and 180 GeV after the combinatorial Higgs boson reconstruction has been applied using wide Higgs boson 
mass windows (A = M/j). The parton showering washes out the peaks, therefore reduces and broadens 
them. Both signal distributions develop a softer tail above as a result of the jet combinatorics. For 



"For Mh ~ 110 GeV, only about 7% of the events survive, while A5-^9% of the signal is kept above threshold (cf. the 
respective 1st rows in Table|8]and 3rd rows in Tables|4|and|7). 
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Combinatorial selection using wide Higgs boson mass windows. 
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Figure 3: Mass spectra nieujj' after the combinatorial reconstruction of Higgs boson candidates for very wide 
Higgs boson mass windows. Results are shown for Mh = 130 GeV and Mh = 180 GeV and ez^e+jets final states 
originating from the gg h WW signal (peaked distributions) and the W+jets background (flat distributions). 

Mh = 130 GeV, the tail plateaus due to the off-shell effects mentioned earlier. Figure [3] also illustrates 
why the value of the significance jumps up significantly (as shown in FigureO when we tighten the Higgs 
boson mass window from A = 25 to 10 GeV for = 130 GeV. This effect arises because we place 
our window cuts in a steeply rising W+jets background. 

When we studied which choice of mass window gives us the best results in terms of separating signal 
from background, it came as somewhat of a surprise that we did not have to alter the additional dijet 
mass constraint of M]y — 6 < rrijji < M\y + 5, (5 = 20 GeV. Our studies indicate that it is helpful to 
have the hadronically decaying W boson to be close to its on-shell mass My^. The W boson decaying 
leptonically is then forced to go off-shell {niev^ < My/), a kinematic configuration at odds with most 
W+jets events.Cutting on rrijji therefore helps suppress the dominant background and, moreover, should 
also be convenient to demote the production of multi-jets efficiently. 

For the tighter Higgs boson mass windows, our results show that a simple one-sided lower cut on rrijji, 
i.e. nijji > Mw — 6 is slightly more efficient than using any type of dijet mass window. The one-sided 
cut improves the significances as given in Table [8] by 1-2%. The removal of the upper bound on rrijji 
has however negligible effects on selections using broad Higgs boson mass windows. As a consequence 
of keeping an rrijji constraint the leptonically decaying W will almost always be off-shell, such that the 
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Figure 4: Cut efficiencies eb and es for ei^e+jets final states and different invm naive and combinatorial Higgs 
boson reconstructions taking the mass points Alh = 130, 170 and 210 GeV. The selections are labelled by the row 
numbers as assigned in Tables l4l and ITUTOl row 3 marks the baseline selection (used as benchmark). The left pane 
exhibits the efficiencies found for the minor backgrounds - electroweak and top-antitop pair production (dashed and 
solid lines, respectively) - whereas the right pane displays the eb for the W+jets background (dashed lines) as well 
as the £5 of the gg h WW signal (solid lines). The two plots to the right compared with each other nicely 
visualize why the combinatorial outperforms the naive selection: the signal cut efficiencies get increased, while, for 
VF+jets, the cuts remain about as effective as for the naive approach. Also notice the drop of the = 130 GeV 
signal curves - they show the penalty in employing the same basic cuts as above the WW threshold. 



reconstruction of the longitudinal component of the neutrino's four-momentum cannot succeed without a 
good guess of the mass of the eu^, pair. We will address this issue in Section [431 



4.2.2 Effect of the subdominant backgrounds 

In this section, we examine to what extent the significances of the ideal Higgs boson reconstruction will 
be diluted by contributions from the electroweak and top-pair production of the ez^g-i-jets final states. To 
this end we apply the analysis as established so far, without any modification. 

The first thing to notice is the total inclusive LO cross sections for these minor backgrounds are 0(1) pb 
- substantially smaller than the W production contribution. After the application of the basic cuts, the in- 
clusive eue+l-jet cross sections drop to about 0.5 pb, a factor of 40 below the major background. Including 
all the various ivT-factors, see Tableland Eqs. (fTTl ), (|2TI ). (l22l ). we find that the total significance, 




at the basic selection level is only 2% smaller compared to the significance S/^/B using only VF+jetsF^ 
'^The cross sections stated are LO-like cross sections as obtained with Sherpa: before (after) the basic cuts, we find 1.21 and 
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Significance vs. Higgs boson mass including tine minor backgrounds. 
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Figure 5: Single-background and total significances as a function of Mh for three different Higgs boson candidate 
invm selections as denoted on top of each panel. The ei^e+jets final states are generated from the — > — > WW 
signal, W+jets, electroweak and ti production backgrounds. All Sj-jBl were calculated according to Eqs. ( l23T l 
assuming an integrated luminosity of £ = 10 fb^^ and including electron and muon channels, i.e. ji = 2. The 
total-background significances were obtained with Eq. ( l24b . The lower plots hold the ratios of S/y/Btot over 
S/ \fB for W+jets only. Note that the large ti significances obtained by the naive selection (left panel) result from 
the failure of the relatively harder leading-jet pairs to satisfy the mass window constraints. 

Switching from the baseline selection to a combinatorial selection and finite dijet mass window, the 
single minor-background significances improve by up to 50% above (100% below) the WW threshold. 
So, for beyond-baseline h reconstructions, the significance corrections owing to the inclusion of the minor 
backgrounds will be of the same order as before. This is documented in both Figures |4] and [5] In the 
former we compare the cut efficiencies between all backgrounds and the signal (shown together with the 
VF+jets background in the plots to the right in Figure HJl for various naive and combinatorial Higgs boson 
candidate selections. Firstly, no background cut efficiency ever exceeds any of the eg. In all cases the 
background curves decrease more strongly when tightening the selection. Secondly, the pattern we observe 



0.886 pb (540 and 508 fb) for the electroweak and tt backgrounds, respectively; the resulting single-background significances 
turn out to be almost 6 and 10 times larger than the W+jets SjyfB- 
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for the minor-background cut efficiency curves resembles by and large those of the major background!^ 
For these reasons, the single-background significances plotted versus Mh follow the trend found for the 
W+jets contribution, but remain well above the W+jets significances!^ AH of which is exemplified in 
Figure [5] using three Higgs boson candidate selections, which impose a dijet mass window (corresponding 
to the rows 8, 11 and 13 in Tables l4l 17] and [TOl). namely the naive method with A = 10 GeV (left panel) 
and the combinatorial method with the same and broader window of A = 25 GeV (middle panel). The 
total significances S/ y/Btot resulting from combining the three single backgrounds are also shown. In 
fact, they only decrease by 1-5% as demonstrated by the ratio plots in the lower part of Figure[5] The high 
Mh region is found to receive the larger, 0(5%) corrections once the electroweak and tt contributions 
are included in the overall background. As noted early in Section [3^ experimenters have estimated this 
fraction of events with 5% implying a 2.5% drop in significance. It is reassuring to be able to confirm 
this expectation with our results. Slightly contrary to the expectation, we identify the electroweak as the 
leading minor background in all selections. 

Based on these results it is easy to conclude; at this stage of our analysis we do not have to worry 
about contributions from minor backgrounds. Although additional handles exist to further reduce these 
backgrounds or supplement the (here conservatively chosen) 6-jet veto, it is of far more importance to find 
ways to diminish the W+jets background. We postpone this discussion until Section l4!4l 

4.3 More realistic Higgs boson reconstruction methods 

Up to this point we have ignored one big problem, namely the neutrino problem. In our selection based on 
the reconstruction of invariant masses - which we dubbed invm approach - we currently treat neutrinos 
as if we were able to measure them like leptons. This is, of course, unrealistic and before we can talk 
about further significance improvements, we have to investigate in which way our analysis may fall short 
when switching to more experimentally motivated Higgs boson candidate selections. Under experimental 
conditions, missing energy is taken from the px imbalance in the event. However, in our analysis we then 
make a small simplification and identify the missing energy with the neutrino's transverse momentum as 
given by the Monte Carlo simulation. 

There are multiple choices for how to proceed. Recall that whatever method we pick acts as a selection 
criterion; we decide which two jets to keep in the event based on these variables, therefore we want to 
design variables, which are best at correctly picking out the jets from a Higgs boson decay. One way 
to proceed is to give up complete reconstruction and to work solely with transverse quantities; this is 
clean and unambiguous, but we throw out information. The second approach is to attempt to guess the 
longitudinal neutrino momentum by requiring that some or all of the final-state objects reconstruct an 
object we expect, such as a or Higgs boson. Full reconstruction then gives access to a larger set of 
observables, therefore keeps more handles and information, but it is also more ambiguous. 

'^In particular, the minor-background cut efficiencies show more pronounced drops, if one enhances the baseline to a Higgs 
boson candidate selection, introduces the dijet mass window S or tightens the A Higgs boson mass range. Notice that the naive 
Higgs boson reconstruction very efficiently beats down the tt background. This is because the two leading jets turn out harder 
compared to all other cases. The presence of a sufficient number of subleading jets however makes the selection based on jet 
combinatorics pick a pair of soft jets, and, on the contrary almost too effective for Higgs boson masses above the WW threshold. 

'''Both the electroweak and tt production significances show the same strong enhancement around A/^ = 130 GeV as a result 
of the effect discussed around Figure [3] which is due to the use of a tight Higgs boson mass window. As for W+jets, the minor 
backgrounds fall rapidly for decreasing Alh ■ 
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Mh/Mw 


pzmw 


pzmh 


mt 


mtp 


< 2 

> 2 


'mT,e!^^jj' 


("e) 
("e) 







Table 5: The preferred choice of definition for the 4-particle transverse mass shown for each of the more reahstic 
Higgs boson candidate selections. The uit definitions are given in Eqs. (O and (|9|l. 

To remove some of the combinatorial headache, we use eu^jj' and jj' (transverse) mass windows as 
before; moreover, we can impose criteria on subsets of the event. For example, if the (3-particle) mass of 
the visible system m^jji is greater than the test Higgs boson mass, that particular choice of jets is unphys- 
ical and we can move on to the next choice. A second constraint we often impose is that the 4-particle 
transverse mass does not exceed the upper bound on the Higgs boson mass window: rriT^euejj' ^ + 
As to the definition of rriT, we generally use the definitions stated in Section [2j see Eqs. dSjl and Q. For 
our selections, we found that the distinction of the two rriT definitions in fact only matters when we cal- 
culate the 4-particle transverse masses. Accordingly, each selection comes in two versions either using 
rriT^euejj' or "^Tel jj'- Table [5] we summarize for each type which version is more appropriate to use 
and in what context. Whenever we refer to a specific selection in due course, we understand it according 
to the findings listed in Table |5] 

With these criteria in hand, the different selection methods are specified as follows: 

• mt: we want to test a method where the final state of the Higgs boson decay will be identified 
purely with the help of transverse masses rather than invariant masses. To this end we calculate 
mT,euejj' according to Eqs. ([S]) or (|9l) and prefer the final state giving us the 4-particle transverse 
mass closest to M^. Owing to rriT^evejj' ^ i^euejj' the test mass window is placed on the lower side 
only, M/j — 2 A < rriT^euejj' < -^^/i> with double the size as compared to the other selections. The 
advantage, but also disadvantage of the method is there is no reconstruction. Avoiding reconstruc- 
tion eliminates uncertainties owing to constraining masses plus resolving ambiguities, but means 
we have no access to longitudinal and invariant-mass observables involving the neutrino. 

For the next two selections, we aim to approximately determine the longitudinal momentum of the neu- 
trino, P\\^v^, using knowledge about which value for meu^ should Ukely be reconstructed by the combined 
system,j)e + Pv^ When we write 

2 ^ 2 _ 

'^Ik.. + 2 (^\Jmff^„ + Pik..\lfT,v. -PT,ik.. ■PT,u, -Plik..Plu}j , (25) 

using the "(in)visible" subsystem notation, we note that such problems can be solved up to a twofold 
ambiguity. The difference among the two selections lies in how particles are grouped in Eq. (l25l) and how 
the twofold ambiguity is resolved!^ 

'^Provided the MET cut was passed, we assume here that all MET in the event has been produced by a single neutrino. 
'*If the solutions are complex-valued, we only assign the real part to describe P\\^u^ with no ambiguity left to resolve. 
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• pzmw: in this selection we use the W mass constraint to solve for the neutrino momentum: 

e;/e = ^^w- ambiguity is then resolved by picking the neutrino pz solution, which brings the 
reconstructed mass nieu^jj/ more closely to the Higgs boson test mass Mh- For true signal events, 
it then is more likely to find the reconstructed rrieu^jj' matching the Higgs boson mass. 

The tricky part is to pick the best choice for the m"^^^^^ constraint - meaning is always opti- 
mal given that the W boson may be off-shell? For pzmw, we do the following: first, we inspect the 
transverse mass, rriT^eu^, of the ez/e subsystem in each event. If 

niT eu^ — we choose ni^, ^i,^ — 
"T-r,e!^e' otherwise we pick m^^eve = ^'^w as long as > 2 M\y or 0.9 < niT^eve/Mw < 1-0. 
That is, above and around the WW threshold, we take nieu^ towards Mw. If below threshold 
and rtiT^eue/^w < 0.9, the mass estimate is chosen taking various subsystem invariant and trans- 
verse masses into account but enforcing 

^* to li^ between TTIX' eu^ 

and Mw- For example, if 

rrijjf > 2mTjj' we set ra^^^ue = fnT,euejj' ~ ''^jj' while otherwise m^^^eve = '>^T,eueij' ~ ''^Tjj' 
unless rriejj' > rnx^euejj' where we say m-t^eue — ''^T.ei/e- 

• pzmh: we again infer the neutrino's longitudinal momentum from mass constraints. Although 
technically similar to pzmw - with the "visible" subsystem entering Eq. (1251 ) now being {e, j, j'} 
- we here turn the idea around and already require nif.y^jji w Mh in order to solve for P\\^u^- That 
is to say we enforce the combined system, Pf.jji + p^^, to mimic a Higgs boson signal mass while 
leaving us with reasonable leptonic W masses rriey^ at the same time. When reconstructing the 
signal these observables are likely correlated, while for the background they are uncorrected apart 
from kinematic constraints. 

The details of the method are: we specify the target mass via m^^euejj' = unless we 
find mx^evejj' l^h > 0.94, i.e. the 4-particle transverse mass turns out too large already so that 
''^leuejj' ~ ''^T euejj'/^-^^ more appropriate choice. We approximate the leptonic W bo- 

son mass by m^^eue = -^^fe ~ ^jj' freezing it at m^^eve — if this difference exceeds Mw- 
We however require min(m=K ei^e) = fnT.ev^- Taking this estimate, we can form the absolute dif- 
ference SrUeue = l^ei^e ~ ™'*,ei/el using the reconstructed mass rueue for each possible neutrino 
solution. In the presence of two solutions we define, as a measure of the longitudinal activity, 
bij = m±^ij exp luijl = maxjE'jj ib|?|| jj} with m\ - = mfj +Pxij arid pick the solution that gen- 
erates the smaller be^^, i-c the eug subsystem less likely going forward. We do so unless the other so- 
lution's Snieiy'^ drops below 6meue and {bjj' +be,yij / {irijji +meu'J < {bjj' +beue) / {''^jj' +fneue)+^x 
is satisfied; this is when we pick conversely {6x = 0.5 if M^ < 2 Mw otherwise 6x = 1.0). Finally, 
we ensure that the {e, I'cj, j} set minimizing drriey^ will be preferred by the overall selection among 
all sets reconstructing the same 4-particle mass. Note that we do not reject the selected ensemble 
if the drueue deviation becomes too large; we leave this potential to be exploited by supplemental 
cuts, which we discuss in Section l44l 

We explored in detail how each of these selection criteria compare to the ideal case. Figure |6] shows 
the significances per Mh test point that we can achieve running the different combinatorial selections 
for various, reasonable window parameters. In the upper panels we display them directly on top of the 
(quasi optimal) ideal case, i.e. the invm combinatorial h candidate selection. The plots on the right and 
in the center respectively exhibit the results of the mt and pzmh methods for the whole Mh test range. 
The pzmw method yields similar, yet slightly worse results below the WW mass threshold compared to 
pzmh. Therefore, we split the leftmost pane into two subplots: on the right, one finds the pzmw results 
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Significance vs. Higgs boson mass using more realistic selection procedures. 
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Figure 6: Single W+jets background significances as a function of Mh for 4 different realistic Higgs boson candi- 
date selections using jet combinatorics. The selection types are denoted on top of each panel. Results are shown for 
4 different mass window parameter settings each, overlaying the sort of optimal case defined by the invm combina- 
torial h reconstruction for A = 10, (5 = 20 GeV, which also serves as the main reference. The et^e+jets final states 
are generated from the signal, gg ^ h ^ WW, and the W+jets background. All S/ \fB were calculated according 
to Eqs. ( |23] ) assuming an integrated luminosity of £ = 10 fb^^ and including electron and muon channels, i.e. 

= 2. The lower plots hold the ratios of realistic over ideal S/ \fB. The thick lines are drawn with respect to the 
main reference, while the thin lines are taken from comparing to the invm combinatorial selection relying on the 
same window parameters. 



for masses above threshold; on the left we then already reveal the outcome of the mtp selection, whose 
discussion we postpone until the next subsection. 

The bottom-row plots of Figure [6] depict the respective survival fractions: the significance ratios for 
each selection and different window parameters always taken with respect to the quasi optimal case (cf. 
the thick lines with symbols). To indicate the net effect of the more realistic approaches, we also show 
in each case the significance loss relative to the invm selection using exactly the same mass windows 
as the realistic one (cf. the thin lines with no symbols). We observe serious losses, larger than 50%, if 
one uses selections with tighter window parameters and/or runs for points further away from 2 Miy. 
The reconstruction methods (pzwm, pzmh) do not work better than 90% of the quasi optimal case. This 
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Signal over major-background ratio using more realistic selection procedures. 
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Figure 7: S/ B ratios as a function of M/, for 4 types of realistic combinatorial Higgs boson candidate selections, 
each with their window parameters chosen as to reach maximal significance. The ei'e+jets final states are generated 
from the gg ^ h WW signal and W+jets background. The S/B were calculated from the as and as as obtained 
after the selection, the signal X-factors of Table[T]and Kb as given in Eq. ( [TT] ). 



only is improved by the mt approach making use of broad Higgs boson mass windows where one can 
reach up to about 100%. However, lowering A here quickly results in losing sensitivity. Related to the 
quasi optimal invm selection, the various results point us to work with medium-sized Higgs boson mass 
windows always imposing the dijet mass cuts. Tighter A constraints may help improve the outcome of 
the reconstruction types, but are of disadvantage in the measurement. 

In all selections the below-threshold region is especially problematic. One might settle for 65-80% 
of the ideal significances, but if we want to get a better handle on the low h boson masses, in particular 
include the = 130 GeV mass point, we have to push further - which we do so in Section l4.3.1l 

Focusing on the above-threshold region, we see that pzmw slightly outperforms pzmh over the whole 
range; only for the near-threshold region up to = 180 GeV this is topped by the mt selection for 
medium-sized A windows. This is somewhat surprising, but seems plausible, if one considers that signal 
events are central in rapidity and manifest themselves in larger transverse activity on average. This has 
to be opposed to the W+2-jets background whose events tend to populate phase space more along the 
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beam direction, which generates Vev^jj' distributions peaking about half an unit away from zero rapidity. 
Nevertheless the differences between the 3 methods are not conclusive per se; to some extent the selections 
will shape distributions differently and it is easy to imagine the picture changing if additional cuts are 
imposed. But, one has to bear in mind, there is a second, very important criterion, the ratio SjB, which one 
wants to maximize. For their optimal window parameters, we show in Figure|7]the Sj B curves of the more 
realistic and ideal selections as functions of Mh- Even more surprisingly than before, the mt outperforms 
the neutrino reconstruction methods and, moreover, the ideal S/B are also beaten unless Mh < 2 Mw- 
Based on this observation one may prefer the methods where the selection utilizes transverse masses, with 
the only drawback of having no py ^^ estimate available. 

4.3.1 More realistic reconstruction below the on-shell diboson mass threshold 

We have just seen that the significances achievable in more realistic scenarios drop off considerably below 
the WW mass threshold, amplifying the loss already present in the ideal case. Therefore, it is of great 
importance to learn how the reconstruction methods described above can be applied more efficiently in 
the below-threshold region. 

We noticed that the mt selection picks up background events, which often fall outside (mostly above) 
the Higgs boson mass window. As a result, a somewhat different class of background events survives 
the mt selection procedure compared to utilizing the invm, ideal, approach. This is no surprise since 
we have already argued that the Higgs boson decays yield an enhanced transverse production with regard 
to the W+2-jet background. Using Mh = 130 GeV, Figure [8] exemplifies this by means of the meu^jji 
and rriT^euejj' /"i^euejj' r^tio distributions. We clearly see the large impact on the W+jets results being a 
consequence of enforcing a transverse- rather than invariant-mass window. Imposing the constraint 80 < 
rriT^eu^j' /GeV < 130 on the background is fairly equivalent to choosing events with larger longitudinal 
components. This drives the associated invariant masses to higher values whereas the ttit/tti ratios are 
shifted to lower ones. 

To give a more quantitative example, we consider the case Mh = 130 GeV for broad (tight) Higgs 
boson mass windows and 6 = 20 GeV. If we select events using mt , then discard all those events with 
an invariant mass mf.y^jji outside Mh ± 25 (8) GeV, we find a quite impressive gain of 82% (360%). 
Similarly, if we use the invm selection and apply further cuts removing events with rriT^euejj' values 
greater than Mh or less than Mh — 2 A = Mh — 50 (16) GeV, we observe that S/^/B improves by 
12% (drops by 20%). As the invm selection has a better starting S/ \fB than mt, the final significances 
are similar in all cases, however the essential point is we can improve the significance by combining the 
transverse- and invariant-mass selections. 

To exploit this potential in a more realistic scenario, we use (in a first phase) the mt selection to pick 
out the 4-particle system, then (in a second phase) reconstruct the neutrino momentum following the 
pzmw procedure. Compared to the ideal case, this reconstruction works rather inefficiently in identifying 
background events that yield invariant masses mev^jy > Mh + A. As it is optimized to the features of 
the Higgs boson decay signal, pzmw generates m^^eve estimates by assuming mT^eue/''^eve ratios close 
to 1. For W+jets, these choices often turn out to be sufficiently smaller than the actual masses of the 
leptonically decaying W boson. The W+jets background usually contains an on-shell W , a lower m^^^eue 
is not ideal and tends to bring down the deconstructed, associated 4-particle masses nieuejj'- As a result 
a good fraction of background events possessing true invariant masses exceeding the upper bound on 
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Figure 8: Differential 4-particle mass spectra niev^jj' (see left panel) and rriT^e-y^jj' I'^e-v^n' ratio distributions 
after combinatorial selection of Higgs boson candidates using a 50 GeV symmetric mass window centered at a mass 
of Mh = 130 GeV. The mt selection characteristics is compared to the ideal case of choosing candidates according 
to the invm criteria. Eq. ([8]) was used to compute the mt criteria. All results are shown for et^e+jets final states 
originating from the — ^ ft, — > WW signal (solid lines) and the W+jets background (dashed lines). 



rueuejj' (see left plot of Figure (8]) are shuffled back into the Higgs boson mass window. 

Hence, we make adjustments to the pzmw reconstruction used in this symbiosis of selections so that 
it performs better below threshold. The basic idea is to maximally exploit the differences of signal and 
background in the leptonic and hadronic W mass distributions. If, after initial mt selection, we constrain 
the transverse masses of the eu^ and dijet subsystems by placing cuts that favor mx^eue ^ Mw/2 and 
rriTjj' ~ Mw, we enforce off-shell W — )■ efg decays while keeping those of 1^ — jj' on-shell. This 
is beneficial to the signal and suppresses, at the same time, the W+jets background. Here, we only add 
requirements on the hadronic subsystem by imposing a minimum value for mx^j- We leave the mT^eu^ 
potential to be exploited by the M/i-dependent cut optimization (which we discuss in Section l44l) . since 
demanding an upper bound on mT,eve would not only enhance mt but all realistic selections. 

With this major adjustment, we can now design a promising mt+pzmw selection, which we call mtp: 

• mtp: the very first part of computing mT,euejj' 1^ identical to the mt selection. Subsequently, we 
require mxjj' to be greater than JTij^'j^/ = 3.51 Mh — 0.015 M| — 135.4 GeV, which we have 
parametrized in terms of Mh for convenience. This concludes the phase of testing the transverse 
criteria. Accepted {e, Ve,3:j'} candidates are subject to the neutrino reconstruction, whose imple- 
mentation deviates from pzmw to some extent: we choose gi/e = 0.55 rrix^eue +0.45 Q freezing it 
below rriT^eve here we employ Q = — rrijji but keep it constant above M]y + 6x {6x = 4 GeV). 
As in all other reconstruction methods we then solve for the longitudinal momentum of the neutrino, 
cf. Eq. (|25] ). and reject the particular {e, fe, J, j'} choice if the solutions are de gen erate or generate a 
mean rrig^^jji mass deviating from Mh by more than A' = max{A, 20 GeV}l£] In all other cases, 
we pick the solution giving the smaller \meuejj' ~ Mh\ and accept it if the reconstructed h mass 

'^Here, we do not adopt tight A choices owing to the uncertainties intrinsic to the reconstruction of the p\\.i,^ component. 
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falls inside the A' window around M/^. 

The mtp results we can reach in terms of significance and sensitivity are known already from Fig- 
ures[6]and|7] The increase in S/Vb (leftmost plot in Figure^ and S/B (FigurelT]) is highly visible for all 
of the h test masses below threshold. For the mtp analyses with broadest windows, the significance 
effect is huge compared to the respective ideal selections using the same A parameter. Likewise the signal 
over background ratio turns out similarly or even better than in the quasi optimal case. 

Not only does mtp profit from the better selection performance, but we end up with a fully recon- 
structed neutrino vector, giving us access to a larger set of observables. However, because of the hard 
cut on rriTjji, we notice that the mtp-selected backgrounds are strongly sculpted, washing out a number 
of shape differences. The effect of the m^jji cut moreover deteriorates as soon as > 2 My/- We 
obtain significances similar to the other methods and since they sculpt the background less, there is no 
real advantage to applying mtp above threshold, hence, we restrict its use to the low-mass h region. On 
that note, it remains to be studied whether the S/ Vb, S/B improvements came at the expense of further 
handles for the Af^-dependent optimization. As a possible consequence the mtp selection actually might 
be superseded by an - at this level - inferior selection once enhanced by appropriately designed cuts. 



4.4 Optimized selection — analyses refinements and (further) significance improve- 
ments 

Having established the more realistic overall picture, we here discuss steps to achieve better signal over 
background discrimination. After baseline and combinatorial selections, we are interested in cuts that 
help further increase the significances obtained so far, i.e. j < e| •. Our aim is to identify observables 
for each M^. test point that are sufficiently uncorrected such that simultaneous selections yield a total 
significance gain in the range: 

max <^ —= , —= } < —= < . (26) 

Above relation is written out for the example of two extra handles, but easily extensible to multi-cut sce- 
narios. As we have shown, the subdominant backgrounds have negligible effects at this level; we therefore 
concentrate on reducing the W+jets background, although the other backgrounds are still included in com- 
puting the total significance. In order to be conservative about mass resolutions for hadronic final states at 
the Tevatron, we will fix the mass window parameters as A = 5 = 20 GeV. 

As we have done in earlier sections, we divide the optimized selection into three broad Higgs boson 
mass ranges: below threshold, near threshold, and above threshold. As we vary the Higgs boson mass, 
we probe different kinematic configurations for the background. For the lowest Higgs boson mass, many 
background distributions (Ht, Pt, etc.) are steeply rising in the region of interest, cut off from below by 
baseline kinematics. For intermediate Higgs boson masses, the backgrounds tend to be flatter or peaked, 
while the higher Higgs boson mass region overlaps with backgrounds that are sharply falling. This basic 
shape behind (many) background distributions drives which cuts are optimal for a given Higgs boson 
mass. We present a number of distributions in Appendix IB. 21 to back up the many findings presented in 
this subsection. 

The optimized analysis employs only observables constructed out of the momenta of the selected 4- 
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particle system {e, Ve,3,j'}- More inclusive observables may be useful: for example, the scalar sum of 
the two selected-jet px versus the two hardest-jet pT, Htjj' -h- Ht,i2- However such observables may 
also be subject to larger uncertainties from, e.g. the modeling of hard initial state radiation. To reduce 
these uncertainties one could extend the ME-i-PS program here, e.g. to inclusive W+3-jets, however this is 
beyond the scope of this study. 

In the optimized analysis the longitudinal observables, Ar/jj/ or 'r]ev^jj', offer only moderate gains in 
significance (typically, we obtain gains on the order of 3-10% with larger gains occurring for heavy Mh), 
but they are insensitive to the exact value of the Higgs boson mass and are therefore more broadly appli- 
cable. The total 4-particle pseudo-rapidity rj^^^^jji can only be used in the pzmw, pzmh or mtp selections, 
since reconstruction of the neutrino is necessary. By the same logic, because the pseudo-rapidity differ- 
ence between the selected jets is independent of the neutrino, cuts on A.7]jji can be used with all selections 
(though they are not efficient for mtp). We find that the requirements |Ar7jj/| < 1.5 and Iveu^jj'l ^ 3.0 
work well for the entire range of Higgs boson masses we are interested in, so we include these cuts into 
our optimized (pzmh/w and mt) selection. In Appendix IB. 2l we present examples of | Ar/j j/| distributions 
(Figure [T2I) and r/ei^eii' spectra (Figure [T3l) after un-optimized combinatorial selections, documenting the 
usefulness of these cuts. 

The other useful observables we have found are all transverse, or in some cases based on invariant 
masses. Unlike the longitudinal variables, the optimal transverse variables and cut values depend strongly 
on the Higgs boson mass. We also find that transverse and longitudinal observables are largely uncorre- 
lated in this study, so any gains in significance from selections in the transverse observables will add to the 
gains from |Ar/jj/| and \rieuejj'\- We refer to Figures [14] and [15] of Appendix IB .21 conveniently illustrating 
this decorrelation for the case jAr^j ,,v| versus rrieu^jji. 



Below-threshold region: For pzmw and pzmh, the reconstruction selections, the transverse mass of 
the 4-particle system, rn'^"}^ jj,, and (for pzmw at low Mfi) the transverse mass of the leptonic system, 
iT^T.eue, are the best observables. This result is really just a reiteration of the fact that simple reconstruction 
selections work poorly for below-threshold Higgs bosons. In our effort to make p z mh / w more flexible and 
apply them to below-threshold scenarios, we have allowed the possibility for background configurations 
that are inconsistent with a single parent resonance - such as 4-particle systems with rn^^^^^^jj, > Mh- 
Removing this inconsistent region results in gains of 0(40%). 

For mt, the transverse mass of the dijet system or the scalar pT sum formed with the selected jets, 
Htjj', are the most optimal, with improvements of 0(50%). The former cut takes advantage of the fact 
that the signal jets originate in a (real or virtual) W boson, while the background jets come primarily from 
ISR - information that is not exploited in the initial mt selection. Somewhat smaller gains come from 
cutting on the total selected system's transverse momentum, PT,ei/ejj'- 

For mtp, there is little optimization to be done. Much of the physics that is behind the optimal cuts in 
the pzmh/w or mt cases has already been incorporated into the selection process. Some improvement is 
possible by cutting out the region with very low 4-particle transverse momentum, PT,ei/ejj' ■ 

We illustrate our findings by showing and commenting on a small collection of spectra resulting from 
the baseline combinatorial selections; for more details, we refer the reader to the discussion around Fig- 
ure [16] of Appendix IB. 21 



37 



Ml, comb. /)-rcco leading = major cut gain subleading cut gain minor cut gain 

[Gc\'] selection [range iu C;(-\'] ['X] [range iu C4e\'] ['X] [range iu CieX'] [%] 



mtp rrijji [75, oo] 17 ^tjj' [76, oo] 9 prj [38, oo] 4 

120 mt mTjj' [72,oo| 70 Htjj' [72, cx)] 52 HT,ei.,jj' [108, oo] 6 

pzmw mT.ei^e [0,40] 73 "^r^eLii' [0,120] 38 i^T.ii' [64, oo] 10 

pzmh fn^T^'lejj' [0'120] 63 me^^M' [0,124] 59 '^±,33' [76, 00] 48 



mtp PT,ei^en' [9, 00] 6 rnT,eve [0, 48] 4 iTT'±,jj' [76, oo] 4 

130 mt -ffT.jj' [72, oo] 38 rriTjj' [68,00] 31 mjj/ [73, 00] 4 

pzmw mr.e,., [0,48] 53 w-r'^eLij' [0,130] 42 "ix,jj' [72, 00] 11 

pzmh f^T'l.jj' [0,130] 48 m^i^^jy [0,134] 46 i/r.^j' [68,00] 25 



mtp PT,euejj' [12, 00] 8 Hj-jj' [68, 00] 3 mT,evc [0, 60] 3 

140 mt Htjj' [68,00] 24 PT.cvcjj' [16, 00] 15 PT,ei'ejj' [15, 00] 6 

pzmw mr.e^e [0,56] 30 ^r'ei^n' [0,140] 30 f"x,jj' [70, 00] 6 

pzmh '^T^'l.jj' [0,140] 29 w-e^eii' [0,144] 28 //T.jj' [68,00] 14 

mtp PT,e^^jj' [17,00] 14 HT,^u^jj' [116, 00] 3 HT,ev^jj' [116, 00] * 

150 mt PT,£v^jj' [20,00] 18 Hx^ji [60, 00] 10 Hx^j' [56, 00] * 

pzmw ^t'L^h' [0,150] 20 PT,e.u,j]' [18, 00] 18 PT.ev^jj' [18, 00] 9 

pzmh ^T'l^n' [0,150] 19 m,,^,,, [0,154] 19 PT,e.,jy [18, 00] 9 



mt PT,e,yen' [18, Oo] 18 Hx^eu^jj' [136, Oo] 12 A</)e,i.e > 1-9 3 

165 pzmw PT,evejj' [18, Oo] 18 l^efejj' [0,170] 17 7j;,'|ei^e — 1-06 20 

pzmh PT,eveij' [18, Oo] 18 "^e!/eM' [0,170] 13 Ijj'lei'e ^ 1-09 15 

mt PT,ev,jj' [21, oo] 20 i?T,e^»ij' [140, Oo] 16 A(/)e,i., > 1.7 * 

170 pzmw PT,ei'ejj' [19, oo] 20 ^ev^jj' [0,176] 13 7jj'|ei/e < 1-12 11 

pzmh PT,euejj' [20, oo] 20 'm^veii' [0,176] 9 7jj'|ei/e < 1-16 9 



mt PT,e^,jj' [22, oo] 24 HT,eu^jj' [148,00] 22 Hr^e^^jf [140, oo] * 

180 pzmw PT,ei.,jj' [21, oo] 23 -H"T,jj' [64, oo] 11 1-06 < 7jy|e^^ < 1.22 5 

pzmh PT,ei,,jj' [22, oo] 24 HT,eu,n' [140, oo] 10 "^r'L^n' [136,182] 5 



mt PT,ei^^jj' [24, oo] 28 HT^ev^jj' [156, oo] 27 HT,e,.,jj' [148, oo] * 

190 pzmw [23,oo] 24 Ht^^^^jj, [U8,oo] 15 1.12 < 7jj.|e,^ < 1.30 5 

pzmh PT,ei'ejj' 

[24, oo] 29 HT,eu,jj' [144, oo] 17 ^T'el.n' [142,194] 3 



mt HT,c„,.jj' [164, oo] 31 PT,eucjj' [24, oo] 28 PT,evcjj' [15, oo] 9 

200 pzmw PT. 'eu^n' [24, oo] 25 HT,e,^,jf [156, oo] 20 1.18 < 7jj/|e^, < 1.40 6 

pzmh PT,ei'ejj' 

[27, oo] 32 HT,eu^jf [156, oo] 25 Ht ,eve33' [144, Oo] 4 



mt HT,eue3i' [172, Oo] 36 PT.ei^eJj' [25, Oo] 27 PT,e^ejj' [15, Oo] 8 

210 pzmw HT,^„^jji [160, oo] 24 PT.e^ejj' [24, oo] 23 1.25 < 733/1^^^ < 1.45 14 

pzmh Hx.eucjj' [162, 00] 36 Pr.eu^jj' [30, 00] 36 1.25 < 7jj'|ey_. < 1.54 7 



mt Hr^ei^^jj' [180, 00] 39 mT,ei,^jj' [174, 00] 26 PT,ei^^jj' [12, 00] 8 

220 pzmw Hr,e.^^3' [168,oo] 29 p^.e^^.y [24, 00] 22 1.31 < 7,y|e.e < 1.53 15 

pzmh HT,ev,^jj' [172,00] 49 PT,^ue [56, 00] 43 1.30 < 7j.,'|e..e < 1-58 8 



Table 6: Leading (optimal/major) and subleading cuts for each Higgs boson mass and selection criteria (all selec- 
tions are combinatorial with window parameters A = (5 = 20 GcV). Gain in S/VB, in percent, is shown after each 
cut. Having used the major discriminators including pseudo-rapidity cuts (see text), next in the cut hierarchy are 
the minor cuts shown in the two rightmost columns. The significance gains associated with them are understood in 
addition to the major cut improvements. Cuts marked with an asterisk have less than 2% improvement. 
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Near-threshold region: For Higgs boson masses close to the WW threshold, the 4-particle pt is the 
most powerful additional handle. In Higgs boson production, as in other colour-singlet resonance pro- 
duction, the PT,euejj' distribution is cut off at low values by soft-gluon resummation, and falls off at high 
values because of parton kinematics. The result is a peaked distribution. The hard scale of the process, 
dictated by the Higgs boson mass, sets the initial ISR scale, thereby influencing where PT,euejj' peaks. 
Since the Higgs boson is heavier than the W boson, PT,euejj' always peaks at higher values for the sig- 
nal compared to W production. Even though the dominant background for our study is W+2-jets, rather 
than VF+O-jets, the argument still holds. The peak in PT,euejj' for W+2-jets is still governed by Mw and 
continues to peak at lower values than the Higgs boson signal. Selection criteria change the tails of the 
background PT,euejj' distribution, but do not affect the location of the peak. Cutting out the low-pj^euejj' 
region, improvements on the order of 20% are possible. Distributions such as Htjj', the scalar px sum 
of the two selected jets, or Hx^eu^jj', the Ht of the 4-particle system, also have potential discriminat- 
ing power. Signal versus background distributions in the relevant variables after baseline combinatorial 
selections are shown in Figure [T7J see Appendix IB. 21 

Above-threshold region: For higher Higgs boson masses, the total amount of (transverse) energy 
in the W+jj' system becomes the most powerful discriminator between the signal and the background. 
Specifically, once Mh > 200 GeV, the Ht of the selected 4-particle system peaks at significantly higher 
values than the background, regardless of the selection technique. By cutting away the Iow-Ht region, 
we find gains of order 25% are possible. The 4-object px remains a very useful observable, as does the 
4-particle transverse mass. Examples of signal versus background distributions are shown in Figure [TH 
see Appendix IB. 21 

Results: The optimal or "leading" or "major" cuts for the different Higgs boson mass categories and 
selection methods are summarized in Table [6] To give some idea how useful the single best discriminator 
is compared to other observables, we also show the percent increase in significance for the second best 
or "subleading" single discriminator. Separately we have also determined which combinations of the 
leading discriminator (supplemented by the respective pseudo-rapidity cuts discussed above) with a second 
observable give the largest (additional) increase in significance. The second "minor" cuts of these optimal 
two- variable selections are summarized in the last two columns of Table [6l Note that in most cases these 
minor cuts do not involve the same observables as the subleading cut; this is because the subleading 
discriminator is typically strongly correlated with the leading discriminator, and thus does not add much 
to the combined significance. Some ideas beyond the application of minor cuts exist; we comment in 
Appendix IB. 3 1 on a few possible routes one can take to enhance the optimized analyses presented here. 

When looking for minor cuts, we found in addition to variables we have already discussed, such as 
ljj'\ei/e (ii^ Sectionllll, H^^^eue)]]' ^^^^ the 4-object pT, a few other observables, namely PT,j, ^4>e,ve ^i^d 
m± jj' to be beneficial. The first two, pT,j and A(/)e,i/g are common, so we do not repeat their definitions 
here. The last minor cut observable, m±jj' is defined through "ij^ ~ "^jj' + Prjj' exhibiting yet 
another way of defining a transverse mass. The additional gains from the minor cuts are typically small, 
except close to the WW threshold and for the largest Higgs boson masses considered here. In particular for 
Mfi > 2 Miy, the boost of the jj' system in the reconstructed 4-object rest frame stands out as a helpful 
discriminator of secondary order; for more details, we refer again to Appendix IB. 21 and the discussion 
around Figure [l9l Also, the dijet-system based handles, rn±jj' and Hj-jj' yield fairly substantial extra 
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gains, but only at low if we rely on the pzmh method. Once jets are picked stemming from the 
backgrounds, the strict Mh mass reconstruction of the selected 4-object, as encoded in pzmh and amplified 
by the major cut given through rn^^^^ ■■, , gives rise to the selection of less energetic j, j' jets with preferred 
pair masses of nijji ~ M\y — 6. The observables rnxjj' and Htjj' exploit this fact conveniently, hence 
facilitate such secondary improvements, as shown in Figure[T9]for the example of Mh = 130 GeV. 

Finally, the baseline combinatorial and optimized combinatorial significances are displayed as a func- 
tion of the Higgs boson mass in Figure |9l The ratio plots associated with each selection in the lower part of 
the figure visualize the significance increase achieved by the optimization. Independent of the selection, 
they also indicate a 0(10%) drop of significances caused by the subdominant backgrounds. Focusing on 
the S/ Vb ratios taken with respect to the ideal case (orange lines with circles), these ratio plots empha- 
size that the optimized mt (p) (transverse) and pzmw selections work best below and above the WW 
mass threshold, respectively. The related S/B ratios presented in Figure [lO] confirm these findings. They 
turn out to be rather small, as a consequence of maximizing the significance and trying to preserve most 
of the signal; both of which does not allow for imposing too restrictive cuts. Advantageously, the actual 
number of signal events, S, present in this h — )• WW channel is not small. Except for the Higgs signal 
at Mfi = 120 GeV (with 0(4) expected events), the optimized analyses usually leave us with hundreds 
of signal events (50-300), if we assume an integrated luminosity of £ = 10 fb~^. Even the ~ 25 signal 
events for = 130 GeV are sufficient, particularly as S/B increases to ~ 0.04. 

Clearly, as seen from Figure |9l the optimized significances for the four different Higgs boson recon- 
struction methods are very similar". The best significance is for Higgs boson masses close to the WW 
threshold, suggesting that it would be possible to achieve 95% confidence level exclusion in a stand-alone 
analysis. For a Higgs boson mass in the range 130 < Mh < 150 GeV, the optimized significance is 
between 0.7 and 1.4. Given the additional improvements expected from a full multivariate analysis, this 
indicates that the semileptonic Higgs boson decay channel can make a significant contribution to Higgs 
boson exclusion in this mass range. 

5 Conclusions, caveats, and prospects 

We have presented a systematic study of the prospects for extending the Tevatron exclusion reach for a 
Standard Model Higgs boson by including the final states arising from semileptonic Higgs boson decays. 
We have used a realistic simulation of the Higgs signal and the relevant Standard Model background 
processes to exhibit the kinematic differences between the signal and background. We have used three 
qualitatively different approaches to extracting the event kinematics, one based on transverse observables 
and the others based on approximate even-by-event full reconstruction. We have shown that all three 
approaches give similar results when one optimizes selections based on several discriminating observables. 
The details of the optimization depend on the Higgs boson mass, and in particular on whether it is below, 
near, or above the threshold for decay to two on-shell W bosons. 

The optimized significances that we have achieved are not sufficient for stand-alone Higgs boson ex- 
clusion except in the most favorable case where the Higgs boson mass is close to threshold. However the 
sensitivities shown here are certainly promising as ingredients to a combined multi-channel analysis. 

One important caveat is that the signal to background ratios for this type of analysis are fairly small, 
on the order of a percent, as illustrated in Figure [TO] This means that Higgs boson exclusion is sensitive to 
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Significance vs. Higgs boson mass after cut optimization for realistic selections. 

pzmw pzmh mtp & mt 




120 140 160 180 200 220 120 140 160 180 200 220 120 140 160 180 200 220 



Higgs boson mass Mh 

Figure 9: Total significances as a function of Mh after including the subdominant backgrounds and all major plus 
minor cuts (as specified in the text and Table |6l). The results (lines with squares) are shown for the four types of 
more realistic Higgs boson candidate selections, which have been advertised in this work. They are denoted on 
top of each panel. In all cases the combinatorial approach using window parameters A = 5 = 20 GeV has been 
applied for selecting the candidate set of particles. As before in Figure|6] the ideal case (i.e. the invm combinatorial 
reconstruction of the Higgs boson candidates using 2 A = (5 = 20 GcV) is taken as the main reference to compare 
the different results. For each combinatorial selection, the outcome (lines with circles) with no cuts applied (but 
using a slightly smaller mass window, A = 15 GeV) is also displayed to emphasize the effect of the cut optimization. 
Note that the effect of the minor backgrounds has been neglected in computing each of these reference curves. The 
ei^e+jets final states are generated from the signal, gg h ^ WW, the W+jets, electroweak and backgrounds. 
All S/y/Bi were calculated using Eqs. (l23T l and combined according to Eq. ( l24b assuming an integrated luminosity 
of £ = 10 fb^^ and including electron and muon channels, i.e. fi — 2. The lower plots depict for each selection the 
ratios of optimized over un-optimized S/ Vb (Unes with squares) and optimized over ideal-case reference S/ a/B 
(lines with circles). The thin blue lines visualize in how far the final S/ ^i3(tot) results suffer from the presence of 
the minor backgrounds. 
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Signal over background ratio after cut optimization for realistic selections. 

[ including minor-background and minor-cut effects ] 
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Figure 10: The S/B ratios associated with the total significances presented in Figure |9] The ratios are shown 
as a function of Mh for the final selections: the combinatorial pzmw (triangles), pzmh (circles), mt (squares) and 
mtp (diamonds) selections using the window parameters A — S ~ 20 GeV each supplemented by their specific 
optimization cuts, see Table |6] The reference curve (dashed line with diamonds) is the same as used in Figure [T] 
representing the invm combinatorial reconstruction with 2 A = 5 = 20 GeV, no extra cuts applied and neglecting 
the impact of the minor backgrounds. The ei^e+jets final states were generated from gg ^ h ^ WW signal events, 
the W+jets, electroweak and ti backgrounds. The S/Bi were calculated using the as and as as obtained after the 
final selections, the signal if -factors of Table[T]and the Kb as given in Eqs. (fTTl i. (l2Tl i and (l22l i. Single ratios were 
combined according to ^/^(tot) = Bi)^^. 

relatively small systematic errors in the modeling of the backgrounds, notably the dominant background 
from W+jets. However we have shown here that an experimental analysis has multiple over-constrained 
handles on the kinematic features of the data, providing extra cross-checks. In addition, the Higgs boson 
candidate selection employed here to identify the two jets from the Higgs boson decay is by design rather 
stable against effects from extra hard radiation; this reduces the uncertainty in the background modeling. 

The techniques described here are applicable to Higgs boson searches at the CERN Large Hadron 
Collider. At 7 TeV center-of-mass collision energy, the Higgs boson production cross section increases 
by a factor of ~ 30, while the W+jets background increases by a factor of ^ 20. With no hard upper 
limit of the amount of data available, one can use more restrictive selections and improve both the signal 
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to background ratio and the overall significance. A recent analysis by the ATLAS collaboration llQll 
employed a 4-object invariant mass reconstruction and a dijet mass window similar to the baseline naive 
analysis used here, but applied to the heavy Higgs boson mass region > 240 GeV. A CMS study 
||92l looked at the enhanced signal to background ratio provided by focusing on VBF production (and thus 
requiring two extra forward jets). Because of the large branching fraction, the h WW semileptonic 
mode might prove to be observable at the LHC over a larger mass range than the h — WW dilepton 
mode; thus it could play a critical role in establishing the nature of electroweak symmetry breaking, in the 
event that the Higgs boson mass is ~ 120 GeV. 
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A Appendix: Monte Carlo event generation 

A.l Leading-order cross sections 

Here we briefly report on the tests we have done to convince ourselves of the correctness of the Sherpa 
leading-order cross section calculations. We found satisfactory or better agreement in all our cross-checks, 
which we briefly summarize here: 

• For the case of /i — e'UePP decays, we have compared Sherpa's branching ratios with those 
obtained by multiplying the Hdecay results for Bw*w* given in Table[T]times the PDG literature 
numbers for B(W ei^e) x B{W pp) = 0.1075 x 0.676. Using Amegic++'s mode of 
calculating partial widths of 1 — )• processes, we determined B^-p^pp = r(/i — )• e~ VePp) /^h for 
the various Higgs boson masses and respective widths of Tabled] The differences seen are at most 
on the few-percent level. 

• With the explicit knowledge of the Sherpa branching fractions we were able to extract Higgs boson 
production rates at LO from the Sherpa signal cross section calculations according to 



.LO 

99h 



T{h — )• e VePP) 




'e Uepp 



(27) 



The numbers that we obtained from this procedure compare well to numbers of other LO calcu- 
lations, for example the LO rates as evaluated in MCFM or provided by Becher and Yang for 
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verification purposes II93II . 



Sherpa LO rates were computed for both finite top masses and in tlie infinite top-mass limit. The 
ratio of the former over the latter cross section given as 



a 
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(0) 
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(28) 



singles out the dependence on the top mass versus Higgs boson mass ratio, which is encoded by the 
function 



I{x) = 62; + 3x {Ax — 1) 
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(29) 



Note that attains unity as x — )• 00, while it vanishes for x — )■ 0. Comparing the numerical 

cross section ratios with the analytical values for |/(x)p, we found excellent agreement over the 
entire Higgs boson mass range considered in this study. 



A. 2 NLO calculations versus CKKW ME+PS merging 

When compared to NLO calculations, it is evident that Sherpa's CKKW merging approach does not 
account for the virtual corrections to F+jets in their entiretyF^ The only contributions enter through Su- 
dakov form-factor terms at leading-logarithmic accuracy used in the parton shower and to reweight the 
tree-level matrix elements. The real-emission corrections however are included on a fairly comparable 
level with respect to full NLO calculations^ Unless one decides for a fixed-scale choice at NLO, both 
approaches determine the strong-coupling scales dynamically, i.e. on an event-by-event basis, taking the 
kinematic configuration of the event into account. For all these reasons, it then occurs that the CKKW 
shapes of distributions emerge in many cases quite similarly to those evaluated at NLO, making an appli- 
cation of global K-factors feasible]^ The treatment to fix the strong couplings is different when multiple 
scales are present. While at NLO the scales are set uniformly such that all Og factors obtain the same 
value, in the CKKW method they are set locally by the procedure itself, cf. ll68l 11021 ITOl for example. 

'^For a brief summary of the basics of the CKKW merging, see Ref. 1701 . For the current generation of SHERPA Monte Carlo 
programs, the ME+PS facilities have been extended to allow for truncated showering, which is a major refinement over the 
CKKW approach, see Refs. f76l[94|[95). 

"More recent versions of SHERPA, from version 1.2.3 on, have been enhanced by the means to generate, for a number of 
important processes, events at the hadron level with a rate correct at next- to-leading order in as 1831 1841 |96l . To make this 
work, Sherpa relies on interfacing external one-loop amplitude generators like MCFM 1611 1971 , BlackHat 1981 1991 or, more 
generally, via the Binoth Les Houches Accord |'10(j|. Because of the complexity of the procedure, such improvements are not 
yet available for arbitrary processes, in particular the multi-jet final states we are interested in. 

^"For example, in | 25| a global A'-factor of magnitude 1.33 with respect to the total inclusive cross section as measured by 
CDF 11011 was applied to achieve a good agreement between the data and the SHERPA predictions for inclusive jet multiplicity 
and transverse momentum distributions. 
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This is known as ag reweighting, constituting the second component of the matrix-element reweighting of 
the CKKW method. The assignment of the scales proceeds hierarchically based on the splitting history, 
which is identified by the fcj-jet cluster algorithm when applied to the initial matrix-element configura- 
tion considering physical parton combinations only. The nodal kT values found by the clustering can be 
interpreted as the relative transverse momenta of the identified splittings. They are then used as the scales 
for the strong-coupling constants replacing the predefined choice of the initial matrix-element generation. 
It would be interesting to see if a hierarchical scale setting can further stabilize NLO results, but no such 
as reweighting has been completely worked out yet for NLO calculations. 



B Appendix: Analysis side studies and additional material 
B.l Ideal Higgs boson reconstruction analyses 

We first complete the presentation of our Section 14.21 main results by showing Tables |7] and [8] where 
we display the numbers associated with the high and low Higgs boson mass region, respectively. As in 
Table |4] we list the signal and VF+jet background cross sections, selection efficiencies, S/B ratios and 
significances at different analysis levels. All cuts, the selection procedures, the layout of the tables and the 
interpretation of the results given in the tables have been discussed in detail in Section l42l Note that the 
rightmost column of Table [TOl carries the outcomes for the test mass point = 220 GeV. 

One remark shall be added regarding the magnitude of off-shell effects. The loss one faces due to 
off-shell Higgs bosons can be read off Table [8] by comparing the acceptances after baseline (1st rows) 
and combinatorial Higgs boson selection (4th rows). While above the WW mass threshold the loss 
on the signal (background) is mild (significant) ranging from 1.2-1.3 (2.6-3.2), it steadily increases for 
decreasing M^, approaching 1.8 and 5.3 at M/^ = 130 GeV and = 110 GeV, respectively. The 
background loss factors turn huge (up to 48) because of the steeply falling meu^jj' spectrum (cf. FigureO, 
but this cannot overcome the smallness of S/ \fB due to the signal reduction. 

We now present the results of our side studies, which we decided to put in an appendix in order to not 
distract the flow of the main body. 

The Higgs boson masses used in the analyses are, of course, hypothetical. However, we only considered 
the obvious scenario where the test mass M/j has been set equal to the Higgs boson mass M™^ injected 
while generating the signal predictions. It is clear that one scans over a range of masses when pursuing 
an analysis, and here we do it in steps of 10 GeV; still the actual Higgs boson mass could deviate as much 
as 5 GeV from the assumed mass. Hence, we want to briefly study how strongly the invm Higgs boson 
candidate selections and their related significances depend on the match between the test and injected 
Higgs boson mass. To this end we generated other than default signal predictions for Higgs boson masses 
of M^'^j = 165, 175, 185, 195 GeV and input them into the analyses using = 180 GeV. Figure E] 
shows the outcome of this side study where, for both types of analyses, we collected results for several 
mass window settings. We learn two things from plotting the significance as a function of the Higgs boson 
generation mass. First, the significances that we attain if we keep the selection parameters (M/j, A, b) 
constant are fairly robust over a broader range of generation masses. Yet the maximum Sj \fB occur for 
M/i = M™^ . Secondly we learn, asymmetric window placements such that < M^j^^ are beneficial to 
achieve significance gains. By focusing on a single generation mass, e.g. Af™^ = 190 GeV, we see that 
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Table 7: Impact of the different levels of cuts on the ei^e+jets final states for the gg — > /i — ^ 14^14^ production and 
decay signal and the W+jets background as obtained from Sherpa. Cross sections cs, as, acceptances £s, £b and 
S/ B, S'/a/B ratios are shown for Higgs boson masses of Mh = 190, 200 and 210 GeV. Note that fhij — nnj/GeV 
and S — 6/GeY. Significances were calculated using Eqs. (l23T l assuming £ = 10 fb^^ of integrated luminosity, 
counting both electrons and muons and combining Tevatron experiments. 
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Table 8: Impact of the different levels of cuts on the ez/e+jets final states for the gg — > /i — > WW production 
and decay signal and the W+jets background as obtained from Sherpa. Cross sections 0-5, as, acceptances £5, 
Eb and S/B, S/\fB ratios are shown for Higgs boson masses below the on-shell diboson mass threshold from 
Mh = 110 GeV to Mh = 150 GeV. Note that fhij = rriij/GeV and 5 = 5/GeY. The significances were 
calculated according to Eqs. (l23T l assuming £ = 10 fb^^ of integrated luminosity, counting both electrons and 
muons and combining Tevatron experiments. The layout of the table is the same as in Tables |4] and |71 however a 
smaller number of Higgs boson candidate selections is shown. 
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Figure 11: S/vB significances as a function of injected Higgs boson masses varying from M™^ ~ 165 GeV to 
200 GeV for different mass window parameters A and S. Results of two combinatorial analyses based on the invm 
reconstruction are shown: one using the default setting Mh = A/™^ (dashed lines), the other where the hypothesized 
Higgs boson mass is fixed at Mh = 180 GeV (solid lines). For the different injected Higgs boson masses, the 
et^e+jets final states are generated from the gg ^ h ^ WW signal and the W+jets production background. All 
significances were calculated according to Eqs. ( |23] | taking only the dominant background into account and under 
the assumption of an integrated luminosity of £ = 10 fb^^, including electron and muon channels, i.e. = 2. 



the response in significance can easily get as large as 40%. Tlie strong sensitivity can be understood by 
comparing the signal and background shapes as visualized in Figure [3]or the upper left plot of Figure [T] 

In the remainder of this appendix, we outline the impact of PDF variations and parameter variations 
other than Mh, A and 5 on our analyses. 

The Sherpa calculations resulting from using the CTEQ6.6 PDF libraries give a similar pattern with 
significances that are about 1-10% larger. This can be read off Table|9]and seen in Figured By normaliz- 
ing the Monte Carlo predictions for both PDF choices, MSTW2008 and CTEQ6.6, to the same respective 
theory cross sections, it is altogether reassuring to see that the cut and event selection procedures only in- 
duce deviations of the order of 10% or below. For the combinatorial Higgs boson candidate selection, one 
finds almost the same S/^/B ratios, provided a wide Higgs boson mass window is used. The significances 
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Table 9: Impact of the different levels of cuts on the ez^e+jets final states for the — > /i — > WW production 
and decay signal and the ly+jets background as obtained from Sherpa when using the CTEQ6.6 PDF libraries. 
Cross sections as, ctb, acceptances es, eb and S/B, S/y/B ratios are shown for Higgs boson masses from Mh = 
165 GeV to Mh = 210 GcV. Note that rhi^ = rriij/GeV and S = 6/GcV. All significances were calculated 
according to Eqs. ( |23] | assuming £ = 10 fb^ of integrated luminosity, counting both electron and muon channels 
and combining Tevatron experiments. 

of the CTEQ6.6 calculations outperform those obtained with MSTW2008, once the mass windows for the 
Higgs bosons (A) and dijets (6) are tightened. We find the differences being more pronounced for Higgs 
boson masses just above the WW threshold. 

Speaking of shape differences triggered by the use of different PDFs, we note that the MSTW2008 
PDF set accounts for a larger transverse activity, i.e. rapidity distributions turn out steeper while the pT 
spectra develop 10-25% harder tails compared to the CTEQ6.6 predictions. Also, for MSTW2008, mass 
peaks are more washed out, again resulting in differences of the order of 25%. 

Lastly, apart from the rightmost column. Table \T0\ gives more details for the choice = Mj^-' = 
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Table 10: Impact of the different levels of cuts on the ei^e+jets final states for the — > /i — > WW production 
and decay signal and the W4-jets background as obtained from Sherpa. Cross sections as, ctb, acceptances £5, £b 
and S/B, S/Vb ratios are given for Higgs boson masses of Alh — 180 GeV and Af/,, — 220 GeV. For the former 
mass point, the central column shows the values for a jet pT threshold increased by 10 GeV, while the left column 
has the values for a smaller dijet mass window of S — 15 GeV. A further decrease of the dijet mass window to 
6 = 10 GeV yields S/B = 0.0034 as well as S/VB ^ 1.14 and S/B = 0.0058 as well as S/Vb = 1.36 for 
the combinatorial Higgs boson reconstruction and mass windows of A = 50 and A = 20, respectively. Note that 
/ GeV; all other mass variables denoted by a tilde are understood in the same way. All significances were 
calculated according to Eqs. (l23T l assuming £ = 10 fb^^ of integrated luminosity, counting both electron and muon 
channels and combining Tevatron experiments. 
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180 GeV including the cases where either the dijet mass window has been tightened from 6 = 20 GeV to 
(5 = 15 GeV or the threshold of the jet transverse momenta has been enhanced to pl^^ > 30 GeV. 

B.2 More realistic Higgs boson reconstruction analyses 

We collect here in this appendix additional material to substantiate our findings presented in Section |44l 
Figures [T21 - IT8] display a number of distributions resulting from the baseline plus combinatorial analyses. 
In each plot we show four curves, two predictions each for the signal and the W+jets background as 
obtained after the ideal (invm) and one of the more realistic combinatorial selections. While we vary 
the Higgs boson masses and the choice of the realistic h reconstruction method, we keep the window 
parameters of the combinatorial selections fixed at A = A/GeV = 25 and 6 = 6 /GeV = 20 over the 
whole set of spectra shown in these figures. 

In Figure [19] we look at distribution taken at the intermediate optimization level, including the effects 
of the major cuts but before the application of the respective minor cuts as listed in Table [6l In these 
plots we compare the Monte Carlo predictions for the Higgs boson signals of different Mh with all, 
dominant and subdominant background predictions. All these predictions follow from using realistic 
selection procedures where Higgs boson and dijet mass windows of A = 5 = 20 have been employed. 

Each one-dimensional distribution is supplemented by an one-minus-ratio subplot. The first prediction 
in the respective legend is always taken as the reference, which we have arranged to be a W+jets prediction 
for all plots shown here. The ratio subplots nicely visualize why we place the cuts as given in Table [6l 
Note that in all plots we only compare the shapes, i.e. all distributions are normalized to unit area. 

We start by showing the |A?7j j/| distributions in Figure [121 The differences in the results of the ideal 
and more realistic selections are immaterial; there are essentially no differences in the above-threshold 
cases. Furthermore, the shapes are very stable under variations. We see that placing a cut around 
|A?7j j/j = 1.5 keeps most of the signal, while it removes a large fraction of the W+jets events. We note 
it is only the W+jets background featuring a peak location away from zero, all other backgrounds (not 
shown here) behave similarly to the signal. 

When working with the reconstruction methods, pzmh/w, we have access to another longitudinal 
variable: we can include a cut on the h candidate's pseudo-rapidity to supplement the constraints from the 
major and |Ar/jj/| cuts. Figure [T3] displays various f?evejj' distributions. Again, all predicted shapes are 
rather independent of the choice of the test mass M^. The W+jets background (as well as the electroweak 
background which is not shown here) tends to preferably populate the forward rapidities while the signal 
(and the tt contribution also not shown here) shows up more central. This leads us to require Iveu^jj' \ ^ 3.0 
as pointed out in Section |4!4] to achieve additional significance gains. Deviations between the ideal and 
more realistic reconstructions become visible; they now are 0{25%), this is clearly because one needs 
information about the neutrino to form this observable. We observe that the W+jets background receives 
the larger corrections compared to the signal. 

We add one more comment regarding longitudinal quantities. In this study, as stated in Section I4.4[ 
the pseudo-rapidity variables, which we discussed above, occur largely uncorrected with the transverse 
observables as well as invariant masses. Schematically, we illustrate this on the basis of two-dimensional 
|A?7j versus rrie^^jj' distributions for both the Higgs boson signal and the W+jets background. In 
Figure [14] we show these distributions as resulting from the combinatorial pzmw reconstruction for two 
different Higgs boson masses, below {M^ = 140 GeV) and above (M^ = 180 GeV) the diboson mass 
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Figure 12: Pseudo-rapidity difference between the two selected jets. Predictions for gg — > /i ei/g+jets produc- 
tion obtained after invm (dashed) and more realistic (solid) combinatorial selections (top: pzmh, center: pzmw, 
bottom: mt) are compared with each other and to the corresponding predictions for the W{-^ ez^e)+jets background. 
Left panes show results for Mh = 150 GeV; in all other cases AI/^ = 190 GcV was used. 
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Figure 13: Pseudo-rapidity of the selected {e,i'e,j,j'} set, the Higgs boson candidate. The predictions for 
gg h ez^e+jets production (coloured lines) obtained after the combinatorial invm (dashed) and more realistic 
h reconstruction (solid) selections (upper: pzmh, lower: pzmw) are compared with each other and to the corre- 
sponding predictions of the W{-^ ei'e)+}sts background (black lines). Left panes show results for Alh — 150 GeV, 
while in the right panes the outcomes for Mh = 190 GcV are shown. 

threshold. Similarly, Figure [T5]exhibits the results obtained with the mt selection where the mev^jj' quan- 
tities were reconstructed as in the ideal case. When confronted with the respective 1^+jets backgrounds, 
we notice that the predictions originating from the production of Higgs bosons cover rather different pa- 
rameter regions in the rriev^jj' - |Ar/j j/| plane. This happens independent of the value of Mh and the 
chosen combinatorial selection. Based on these observations, we expect the total S/\fB increase to al- 
most completely factorize into a product of single S/ \/B improvement factors. 
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Figure 14: Two-dimensional distributions showing the selected-jet pseudo-rapidity difference, jAy^j j'!, plotted 
versus the reconstructed mass mev^jj' of the selected {e, z^e, J, j'} combinations. The predictions for gg ^ h ^ 
ei^e+jets production obtained after pzmw reconstruction of Higgs boson candidates are compared with each other 
and to the corresponding predictions given by the W{—?' ei^e)+jets background. The upper (lower) plots represent 
the signal (background) predictions, while the left (right) panes show results for Mh — 140 (180) GeV. 



In Figure [16] we depict examples of potential discriminators below the diboson mass threshold. From 
the upper left to the lower right we present, for the choice of Mh = 130 GeV, the transverse mass g-p^ 
of the leptonically decaying W^, the scalar pT sum Htjj' of the selected jets, the transverse masses of the 

selected 4-particle set, m^^'^^+y jj" of the selected jets, mxjj', as well as their corresponding invariant 
masses, m^+y^jji and mjji. The first three plots in that order are the leading cut variables for pzmw, 
mt and pzmh followed by the subleading cut variables for mt and pzmh, cf. Table [6l In the lower left 
plot we display the subsubleading cut variable for the mt method, nijji, which would yield a significance 
gain of 19% if we were to demand nijji > 72 GeV. Unlike the transverse observables depicted in the 
upper and center panes of Figure [161 the invariant-mass distributions are affected by the modifications of 
the ideal selection owing to a realistic neutrino treatment. The selected 4-object invariant mass (lower 
left) exemplifies to what degree shapes can get distorted by the pzmh approach. The shoulder above 
Mfi = 130 GeV emerges because complex solutions cannot be completely avoided in the reconstruction 
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Figure 15: Two-dimensional distributions showing the selected-jet pseudo-rapidity difference, |A7yj.j'|, plotted 
versus the reconstructed mass niei^^jj' of the selected {e, combinations. The predictions for gg h ^ 

ei^e+jets production obtained after combinatorial selection according to the mt procedure are compared with each 
other and to the corresponding predictions given by the et^e)+jets background. The upper (lower) plots 

represent the signal (background) predictions, while the left (right) panes show results for Mh = 140 (180) GeV. 
Note that for the purpose of illustration, the niei^^jj' quantities are reconstructed as in the ideal case. 



of the neutrino momenta; the lower tail arises from the mT,euejj' constraints on the target mass m^^eu^jj', 
see Section 1431 

The 2-particle transverse masses shown in the upper left and center right of Figure [T6]together with the 
rrijji spectra document why we exploit the signal's preference for on-shell hadronic and off-shell leptonic 
decays of the W bosons. All backgrounds considered in this study disfavor this correlation. The 4-particle 
transverse mass (center left) exhibits - as expected - a nice kinematic edge for the signal at .., = 

Mfi, while all backgrounds are continuous in this variable reaching their broad maxima above the applied 
mass window. The features of the HTjj' handle (upper right) have been akeady described in Section |2l 
For this variable, the electroweak background turns out signal-like whereas the tt background generates 
(by far) the hardest tails. 
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Figure 16: Examples of leading and subleading cut observables below the 2 Mw threshold, for = 130 GeV. 
Predictions are shown for the gg ^ h and W production of ez/e+jets final states using the invm (dashed) and 
more realistic (solid) combinatorial selections. Top to bottom, left panes: rriT^e-u^ (pzmw leading), 7^^^+^ j,y and 
Tn^+jj^jj' (pzmh leading and subleading); right panes: Htjj', '>^T.jj' and rrijjf (mt leading to subsubleading). 
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Figure 17: Examples of (sub)leading cut observables for Higgs boson masses Mh ~ 2Mw- Predictions are 
shown for the gg ^ h and W production of ez^e+jets final states using the invm (dashed) and more realistic 
(solid) combinatorial selections. Top to bottom, left panes: Pr^e+u^jj' ^^'^TUe-y^jj' (pzmh leading and subleading), 
Hx.e-u^jj' (mt subleading); right panes: rrS^'^-^, jj'^'^e-v^jj' ™d iJ^jj' (pzmw leading to subsubleading). 
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Figure 18: Leading and subleading cut observables for Mh choices well above the 2 Mw threshold. Predictions 
are sho'wn for the gg — > h and W production of ei'e+jets final states using the invm (dashed) and more realistic 
(solid) combinatorial selections. Upper panes: Hx^e-v^jj' (leading, pzmh (left) and mt); center panes: Pr.e-u^jj' 
(subleading, pzmh (left) and pzmw); lo'wer panes (subleading), left: PT.e-i>^ (pzmh), right: rriT^e+u^jj' (mt). 
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We now discuss some of the near-threshold discriminators where most of the examples are given for the 
test point = 170 GeV. Figure [T7] shows in its top row the two leading cut variables that we found for 
this mass region. In the upper right we have the transverse mass m^^^__ of the selected {e, i^e,j,j'} 
objects as resulting after the pzmw reconstruction with Mh = 150 GeV. We discover properties very 
similar to those discussed for the case of low Higgs boson masses, cf. Figure [16] In the upper left we have 
depicted the first-rank discriminator for medium Higgs boson masses - the selected 4-object transverse 
momentum distribution. It was introduced early on in Section |2l Here, we present the PT,e+uejj' distri- 
bution as obtained after the combinatorial pzmh selection. We remark that the curves of the other two 
realistic approaches, pzmw and mt, (both not shown here) deviate even less from the respective curves of 
the ideal selection. Also not shown in Figure [l7]but worthwhile to mention, the electroweak background 
would yield spectra similar to those of the l^/'+jets background whereas the top-pair production would turn 
up significantly harder than the signal's px spectra. 

The other example plots of Figure [T7] are chosen from the set of second-leading cut variables. The 
center panes display the invariant mass distributions rrif^-p^ji resulting from the pzmh/w selections. As 
opposed to the - by construction - sculpted shapes of the pzmh method, we observe that the pzmw 
selection reproduces the invariant mass shapes of the invm ideal reconstruction to a large extent. The 
lower panes demonstrate the potential possessed by scalar transverse momentum sums that we exemplify 
by means of the selected-set Hj^ f,-p^jj, observable as given by the mt selection (lower left) and the 
selected-jet Hrpjj' observable resulting from the pzmw reconstruction (lower right). Remarkably, based on 
the 4-object Ht, we can achieve an even clearer separation between the signal and the VF+jets background 
once we select h candidates according to the mt method. 

Figure [18] summarizes the types of discriminating observables as identified in Table [6] for the region 
of large Higgs boson masses; we use = 210 and 220 GeV in the example plots. The upper panels 
represent ^r^e-Pejj' distributions obtained after pzmh (left) and mt combinatorial selections. Between 
these two cases we detect only marginal differences, and similarly between the predictions of the ideal 
and pzmw reconstructions (not shown here). At large Higgs boson masses, the signal develops the peak 
at considerably larger Ht values compared to the VF+jets background. This characterizes the selected- 
set Ht as the strongest handle we have above the WW threshold. Moreover, the pzmh and mt realistic 
selections further enhance the separation between the two peak regions. For the subdominant backgrounds 
(not shown here), we noticed a strong similarity between the Ht spectra arising from the electroweak 
production and the W+jets background. The tt background however yields the hardest spectra both in 
terms of the peak position as well as the tail of the Ht distributions. 

The center panes of Figure [18] show two examples of selected h candidate pT distributions. These vari- 
ables do not constitute the best discriminators anymore, but still quite often rank second best in separating 
signal from W+jets production in the domain of large Mh- As the pzmw reconstruction works extremely 
well for heavy Higgs boson decays into on-shell W bosons, it gives PT,e-uejj' shapes almost identical 
with the ideal selection. 

For the = 220 GeV point, we present two different subleading cut variables in the lower pane of 
Figure [18] To the left, one finds the pt g-p^ distribution of the reconstructed W~ resulting after selecting 
Higgs boson candidates according to pzmh. The purely transverse mass mT,e+uejj' "^^e mt-selected 
4-object set, cf. Eq. ([8]l, is depicted on the lower right. For these variables, we recognize similar features 
as for the HT^ei/ejj' spectra concerning the minor backgrounds and the comparison with the invm Higgs 
boson candidate reconstruction. 
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Figure 19: Examples of minor cut observables after application of major cuts and rapidity constraints including 
spectra resulting from subdominant backgrounds. Predictions using the more realistic combinatorial h candidate 
selections are shown for ei/g+jets final states arising from gg ^ h (solid red), W+jets (solid black), (dashed blue) 
and electroweak (dashed green) production at Tevatron Run II. See text for more details. 
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In Figure [19] we give a brief overview of discriminators that lead to a further increase in significance 
after implementing all major and rapidity constraints discussed in Section l4!4l The boost factor 7^/16;^^' 
introduced in Section |2l proves very helpful in separating signal from backgrounds over a large range of 
above-threshold Higgs boson masses. Since 7jj'|eve develops a peak for the signal only, it is advantageous 
to isolate the boost factor peak region in order to exploit that a not too broad scalar resonance has been 
produced over a multitude of continuous backgrounds. This is exemplified in the upper plots of Figure [T9l 
where we present two boost factor ^jji\e-u^ example distributions for = 165 GeV (left) and = 
220 GeV when selecting via the pzmw method. 

For the pzmh method in particular, we identified two transverse variables that, if constrained from 
below, are very yielding in the low Higgs boson mass region, even at this more involved level of the 
analysis. We exhibit these variables, Htjj' on the left and m±jji on the right, in the center panes of 
Figure [19] choosing M/j = 130 GeV. As in similar cases the significance gain originates from exploiting 
the different peak locations associated with the signal, at higher values, and the dominant background, 
preferring the low values. 

For Mfi very close above threshold, we can use the azimuthal angle between the lepton and MET or 
the two selected jets (as suggested by Han and Zhang in Ref. [7, 8]) to further suppress the backgrounds. 
A low-pT Higgs boson produced at WW threshold gives rise to two longitudinally moving W bosons, 
which in turn each decay into two objects oriented almost back-to-back in the transverse plane. This is 
demonstrated by the example plot in the lower left of Figure [19] where signal and background distributions 
are shown for the A(^g- observable when employing the mt selection for = 165 GeV. 

Finally, we display in the lower right of Figure[l9]an example of a global-event observable, namely Ht 
as calculated from the entire event, not vetoed by the selection and major cuts. It illustrates the hierarchy of 
scales intrinsic to the heavy Higgs boson signal (Mh = 220 GeV) and different background processes; it 
also visualizes the leftover potential when considering the global Ht for the implementation of additional 
cuts, see Appendix IB. 31 

B.3 Directions for additional improvements 

The final sets of events surviving our optimized combinatorial selections of Higgs boson candidates are 
perfect for use as input to a full multivariate analysis. This is primarily because the analysis types presented 
here were geared towards significance maximization, so that a sufficiently large number of events can be 
preserved. 

• S/B improvements: since we are rather safe from a statistics point of view, there is in many 
cases potential to improve S/B by simply requiring more restrictive constraints accepting (mild) 
significance losses at the same time. The simplest way is to cut harder in the tails of the major 
observables; for instance for pzmw at M^. = 220 GeV, using HT^ev^jj' ^ 188 GeV (instead of the 
bound given in Table [6]) results in a 40% gain in S/B while the significance only drops by 10%. 
Of course, a change in variable sometimes is more beneficial for maximizing S/B and keeping 
a reasonable significance. Consider for example mt at = 140 GeV; hardening the Hj^jji 
constraint by cutting out the region below 92 GeV maximizes S/B by doubling it, but reduces the 
significance by a factor of 3.4. In contrast, using niTjj' > 72 GeV gives a factor 1.6 increase in 
S/B, yet only a factor 1.2 decrease in significance. Certainly one can opt for the (other) extreme 
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and totally maximize S/ B, e.g. for the p zmw case just mentioned, we find a huge S/B gain of 600% 
by demanding px^j' > 64 GeV but we actually just traded a reasonable significance associated with 
half a percent S/B for a ~ 4% S/B of very low significance diminished by a factor of 6.5. This 
behaviour is typical owing to the limited amount of data taken at the Tevatron. 

• Overall Ht cut: the philosophy of this study is to only constrain variables involving the candidate 
set of particles. Allowing cut observables sensitive to the whole event structure can lead to additional 
significance improvements but the related uncertainties are larger since such observables are more 
prone to hard radiative corrections that need to be described appropriately, often beyond parton- 
shower modeling. At the level of identifying minor cuts, as given in Table [6j we found that an 
overall Ht cut on the selected events yields in most cases significance gains of the order of 20^0% 
near and above threshold. This is a conservative estimate considering the larger uncertainties on 
such cuts. An example is shown in the lower right of Figure [19] where we see why one benefits by 
constraining Ht from below. In addition one could suppress the tt background by introducing an 
upper Ht bound, or equally, exploit the fact that the leading jets in production yield a substantially 
harder Ht,2 = -f^rjij2 spectrum. 

• Asymmetric mass windows for reconstruction methods: as touched during the discussion of Fig- 
ure [TTl the use of asymmetric test mass windows, i.e. {M^ — ^low, Mh + Aup), can improve the 
realistic selections that either approximate or set the selected 4-object mass, meu^jj'- For all 
values, it is advantageous to choose Aiow larger by a few GeV than A^p where 2 A = Aiow + Aup. 
The results presented in Figure [T] (upper left) and Figure [3] clarify why this is a good idea: first, the 
Higgs boson resonance is deformed by radiative losses and resolution effects (there is no perfect jet 
finding let alone (sub)event reconstruction) amounting to a larger portion of cross section below the 
peak; second, the backgrounds are either sufficiently fiat or - below the WW threshold - steeply 
falling towards smaller masses amplifying the asymmetry effect further. We checked the asymmet- 
ric window scenario for the pzmw method, where this led to significance gains up to 10%. For the 
dijet mass windows, the effects turned out too weak. 
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